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NEISSERIA MENINGITIDIS ANTIGENS AND COMPOSITIONS 

This application is a continuation-in-part of the following U.S. Provisional Patent 
applications, from each of which priority is claimed, and each of which is incorporated by 
reference in its entirety: 60/083,758 (filed May 1, 1998); 60/094,869 (filed July 31, 1998); 
60/098,994 (filed September 2, 1998); 60/099,062 (filed September 2, 1998); 60/103,749 
(filed October 9, 1998); 60/103,794 (filed October 9, 1998); 60/103,796 (filed October 9, 
1998); and 60/121,528 (filed February 25, 1999). 

This invention relates to antigens from the bacterial species: Neisseria meningitidis 
and Neisseria gonorrhoeae. 



BACKGROUND 

Neisseria meningitidis is a non-motile, gram negative diplococcus human pathogen. It 
colonizes the pharynx, causing meningitis and, occasionally, septicaemia in the absence of 
meningitis. It is closely related to N gonorrhoea, although one feature that clearly 
differentiates meningococcus from gonococcus is the presence of a polysaccharide capsule 
that is present in all pathogenic meningococci. 

N meningitidis causes both endemic and epidemic disease. In the United States the 
attack rate is 0.6-1 per 100,000 persons per year, and it can be much greater during outbreaks, 
(see Lieberman et al (1996) Safety and Immunogenicity of a Serogroups A/C Neisseria 
meningitidis Oligosaccharide-Protein Conjugate Vaccine in Young Children. JAMA 
275(19):1499-1503; Schuchat et al (1997) Bacterial Meningitis in the United States in 1995. 
NEnglJMed 337(14):970-976). In developing countries, endemic disease rates are much 
higher and during epidemics incidence rates can reach 500 cases per 100,000 persons per 
year. Mortality is extremely high, at 10-20% in the United States, and much higher in 
developing countries. Following the introduction of the conjugate vaccine against 
Haemophilus influenzae, N meningitidis is the major cause of bacterial meningitis at all ages 
in the United States (Schuchat et al (1997) supra). 

Based on the organism's capsular polysaccharide, 12 serogroups of A^. meningitidis 
have been identified. Group A is the pathogen most often implicated in epidemic disease in 
sub-Saharan Afiica. Serogroups B and C are responsible for the vast majority of cases in the 
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United States and in most developed countries. Serogroups W135 and Y are responsible for 
the rest of the cases in the United States and developed countries. The meningococcal vaccine 
currently in use is a tetravalent polysaccharide vaccine composed of serogroups A, C, Y and 
W135. Although efficacious in adolescents and adults, it induces a poor immune response and 
short duration of protection, and cannot be used in infants [eg. Morbidity and Mortality 
weekly report, Vol.46, No. RR-5 (1997)]. This is because polysaccharides are T-cell 
independent antigens that induce a weak immune response that cannot be boosted by repeated 
immunization. Following the success of the vaccination 2igainst Kinfluenzae, conjugate 
vaccines against serogroups A and C have been developed and are at the final stage of clinical 
testing (Zollinger WD "New and Improved Vaccines Against Meningococcal Disease". In: 
New Generation Vaccines, supra, pp. 469-488; Lieberman et al (1996) supra\ Costantino et al 
(1992) Development and phase I clinical testing of a conjugate vaccine against 
meningococcus A and C. Vaccine 10:691-698). 

Meningococcus B (menB) remains a problem, however. This serotype currently is 
responsible for approximately 50% of total meningitis in the United States, Europe, and South 
America. The polysaccharide approach cannot be used because the menB capsular 
polysaccharide is a polymer of a(2-8)-linked //-acetyl neuraminic acid that is also present in 
mammalian tissue. This results in tolerance to the antigen; indeed, if an immune response 
were elicited, it would be anti-self, and therefore undesirable. In order to avoid induction of 
autoimmunity and to induce a protective immune response, the capsular polysaccharide has, 
for instance, been chemically modified substituting the A^-acetyl groups with iV-propionyl 
groups, leaving the specific antigenicity unaltered (Romero & Outschoom (1994) Current 
status of Meningococcal group B vaccine candidates: capsular or non-capsular? Clin 
Microbiol Rev 7(4):559-575). 

Alternative approaches to menB vaccines have used complex mixtures of outer 
membrane proteins (OMPs), containing either the OMPs alone, or OMPs enriched in porins, 
or deleted of the class 4 OMPs that are believed to induce antibodies that block bactericidal 
activity. This approach produces vaccines that are not well characterized. They are able to 
protect against the homologous strain, but are not effective at large where there are many 
antigenic variants of the outer membrane proteins. To overcome the antigenic variabiUty, 
multivalent vaccines containing up to nine different porins have been constructed {eg, 
Poolman JT (1992) Development of a meningococcal vaccine. Infect Agents Dis. 4:13-28). 
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Additional proteins to be used in outer membrane vaccines have been the opa and opc 
proteins, but none of these approaches have been able to overcome the antigenic variability 
(eg. Ala'Aldeen & Borriello (1996) The meningococcal transferrin-binding proteins 1 and 2 
are both surface exposed and generate bactericidal antibodies capable of killing homologous 
and heterologous strains. Vaccine 14(l):49-53). 

A certain amount of sequence data is available for meningococcal and gonoccocal 
genes and proteins (e.g. EP-A-0467714, W096/29412), but this is by no means complete. 
Other men B proteins may include those listed in WO 97/28273, WO 96/29412, WO 
95/03413, US 5,439,808, and US 5,879,686. 

The provision of further sequences could provide an opportunity to identify secreted 
or surface-exposed proteins that are presumed targets for the immune system and which are 
not antigenically variable. For instance, some of the identified proteins could be components 
of efficacious vaccines against meningococcus B, some could be components of vaccines 
against all meningococcal serotypes, and others could be components of vaccines against all 
pathogenic Neisseriae including Neisseria meningitidis or Neisseria gonorrhoeae. Those 
sequences specific to N meningitidis or N gonorrhoeae that are more highly conserved are 
further preferred sequences. 

It is thus an object of the invention is to provide Neisserial DNA sequences which 
encode proteins that are antigenic or immunogenic. 

BRIEF DESCMPTION OF THE DRAWINGS 

Fig. 1 illustrates the products of protein expression and purification of the predicted 
ORF 919 as cloned and expressed in E. coli. 

Fig. 2 illustrates the products of protein expression and purification of the predicted 
ORF 279 as cloned and expressed in E. coli. 

Fig. 3 illustrates the products of protein expression and purification of the predicted 
ORF 576-1 as cloned and expressed inE, coli. 

Fig. 4 illustrates the products of protein expression and purification of the predicted 
ORF 519-1 as cloned and expressed in E. coli. 

Fig. 5 illustrates the products of protein expression and purification of the predicted 
ORF 121-1 as cloned and expressed in E. coli. 
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Fig. 6 illustrates the products of protein expression and purification of the predicted 
ORF 128-1 as cloned and expressed in E. colL 

Fig. 7 illustrates the products of protein expression and purification of the predicted 
ORF 206 as cloned and expressed in£. co/z. 

Fig. 8 illustrates the products of protein expression and purification of the predicted 
ORF 287 as cloned and expressed in E, coli. 

Fig. 9 illustrates the products of protein expression and purification of the predicted 
ORF 406 as cloned and expressed in E. coli. 

Fig. 10 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 919 as cloned and expressed in E, colL 

Fig. 1 1 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 279 as cloned and expressed in E, coli. 

Fig. 12 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 576-1 as cloned and expressed in E. coli. 

Fig. 13 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 519-1 as cloned and expressed in E, coli. 

Fig. 14 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 121-1 as cloned and expressed in E. coli. 

Fig. 15 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 128-1 as cloned and expressed in E. coli. 

Fig. 16 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 206 as cloned and expressed in E. coli. 

Fig. 17 illustrates the hydrophiUcity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 287 as cloned and expressed in E. coli. 

Fig. 1 8 illustrates the hydrophiUcity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 406 as cloned and expressed in E. coli. 

Fig. 19 shows an alignment comparison of amino acid sequences for ORF 225 for 
several strains of Neisseria, Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 



wo 99/57280 




PCT/US99/09346 



5 



Fig. 20 shows an alignment comparison of amino acid sequences for ORF 235 for 
several strains of Neisseria, Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 21 shows an alignment comparison of amino acid sequences for ORF 287 for 
several strains of Neisseria, Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids v/ith similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 22 shows an alignment comparison of amino acid sequences for ORF 519 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 23 shows an alignment comparison of amino acid sequences for ORF 919 for 
several strains of Neisseria, Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

THE INVENTION 

The invention provides proteins comprising the N meningitidis amino acid sequences 
and N, gonorrhoeae amino acid sequences disclosed in the examples. 

It also provides proteins comprising sequences homologous (i.e., those having 
sequence identity) to the A^. meningitidis amino acid sequences disclosed in the examples. 
Depending on the particular sequence, the degree of homology (sequence identity) is 
preferably greater than 50% (eg 60%, 70%, 80%, 90%, 95%, 99% or more). These proteins 
include mutants and allelic variants of the sequences disclosed in the examples. Typically, 
50% identity or more between two proteins is considered to be an indication of fiinctional 
equivalence. Identity between proteins is preferably determined by the Smith-Waterman 
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homology search algorithm as implemented in MPSRCH program (Oxford Molecular) using 
an affme gap search with parameters:gap penalty 12, gap extension penalty 1 . 

The invention fiirther provides proteins comprising fragments of the//, meningitidis 
amino acid sequences and A^. gonorrhoeae amino acid sequences disclosed in the examples. 
The fragments shoiild comprise at least n consecutive amino acids from the sequences and, 
depending on the particular sequence, n is 7 or more (eg. 8, 10, 12, 14, 16, 18, 20 or more). 
Preferably the fragments comprise an epitope from the sequence. 

The proteins of the invention can, of course, be prepared by various means (eg, 
recombinant expression, purification from cell culture, chemical synthesis etc) and in various 
forms {eg, native, fusions etc.). They are preferably prepared in substantially pure or isolated 
form (ie, substantially free from other M meningitidis otN. gonorrhoeae host cell proteins) 

According to a further aspect, the invention provides antibodies which bind to these 
proteins. These may be polyclonal or monoclonal and may be produced by any suitable 
means. 

According to a fiirther aspect, the invention provides nucleic acid comprising the 
K meningitidis nucleotide sequences and A^. gonorrhoeae nucleotide sequences disclosed in 
the examples. 

According to a further aspect, the invention comprises nucleic acids having sequence 
identity of greater than 50% (e.g., 60%, 70%, 80%, 90%, 95%, 99% or more) to the nucleic 
acid sequences herein. Sequence identity is determined as above-discussed. 

According to a further aspect, the invention comprises nucleic acid that hybridizes to 
the sequences provided herein. Conditions for hybridization are set forth herein. 

Nucleic acid comprising fragments of these sequences are also provided. These should 
comprise at least n consecutive nucleotides from the meningitidis sequences or N, 
gonorrhoeae sequences and depending on the particular sequence, w is 10 or more (eg 12, 14, 
15, 18, 20, 25, 30, 35, 40 or more). 

According to a further aspect, the invention provides nucleic acid encoding the 
proteins and protein fragments of the invention. 

It should also be appreciated that the invention provides nucleic acid comprising 
sequences complementary to those described above (eg for antisense or probing purposes). 

Nucleic acid according to the invention can, of course, be prepared in many ways (eg 
by chemical synthesis, in part or in whole, from genomic or cDNA libraries, from the 
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organism itself e/c.) and can take various forms {eg. single stranded, double stranded, vectors, 
probes eta). 

In addition, the term "nucleic acid" includes DNA and RNA, and also their analogues, 
such as those containing modified backbones, and also protein nucleic acids (PNA) eta 

According to a further aspect, the invention provides vectors comprising nucleotide 
sequences of the invention (eg, expression vectors) and host cells transformed with such 
vectors. 

According to a further aspect, the invention provides compositions comprising protein, 
antibody, and/or nucleic acid according to the invention. These compositions may be suitable 
as vaccines, for instance, or as diagnostic reagents or as immunogenic compositions. 

The invention also provides nucleic acid, protein, or antibody according to the 
invention for use as medicaments (eg. as vaccines) or as diagnostic reagents. It also provides 
the use of nucleic acid, protein, or antibody according to the invention in the manufacture of 
(I) a medicament for treating or preventing infection due to Neisserial bacteria (ii) a 
diagnostic reagent for detecting the presence of Neisserial bacteria or of antibodies raised 
against Neisserial bacteria or (iii) for raising antibodies. Said Neisserial bacteria may be any 
species or strain (such as K gonorrhoeae) but are preferably A^. meningitidis, especially strain 
B or strain C. 

The invention also provides a method of treating a patient, comprising administering 
to the patient a therapeutically effective amount of nucleic acid, protein, and/or antibody 
according to the invention. 

According to further aspects, the invention provides various processes. 

A process for producing proteins of the invention is provided, comprising the step of 
culturing a host cell according to the invention imder conditions which induce protein 
expression. 

A process for detecting polynucleotides of the invention is provided, comprising the 
steps of: (a) contacting a nucleic probe according to the invention with a biological sample 
under hybridizing conditions to form duplexes; and (b) detecting said duplexes. 

A process for detecting proteins of the invention is provided, comprising the steps of: 
(a) contacting an antibody according to the invention with a biological sample under 
conditions suitable for the formation of an antibody-antigen complexes; and (b) detecting 
said complexes. 
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Having now generally described the inventiion, the same will be more readily 
understood through reference to the following examples which are provided by way of 
illustration, and are not intended to be limiting of the present invention, unless specified. 

Methodology - Summary of standard procedures and techniques. 
General 

This invention provides Neisseria meningitidis menB nucleotide sequences, amino 
acid sequences encoded therein. With these disclosed sequences, nucleic acid probe assays 
and expression cassettes and vectors can be produced. The expression vectors can be 
transformed into host cells to produce proteins. The purified or isolated polypeptides (which 
may also be chemically synthesized) can be used to produce antibodies to detect menB 
proteins. Also, the host cells or extracts can be utilized for biological assays to isolate 
agonists or antagonists. In addition, vdth these sequences one can search to identify open 
reading firames and identify amino acid sequences. The proteins may also be used in 
immunogenic compositions, antigenic compositions and as vaccine components. 

The practice of the present invention will employ, imless otherwise indicated, 
conventional techniques of molecular biology, microbiology, recombinant DNA, and 
immunology, which are vdthin the skill of the art. Such techniques are explained fully in the 
literature e.g., Sambrook Afo/ecw/ar Cloning; A Laboratory Manual Second Edition (1989); 
DNA Cloning, Volumes land ii (D.N Glover ed. 1985); Oligonucleotide Synthesis (M.J, Gait 
ed, 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins eds. 1984); Transcription 
and Translation (B.D. Hames & S.J. Higgins eds. 1984); Animal Cell Culture (R.I. Freshney 
ed. 1986); Immobilized Cells and Enzyme (IRL Press, 1986); B. Perbal, A Practical Guide to 
Molecular Cloning (1984); the Methods in Enzymology series (Academic Press, Inc.), 
especially volumes 154 & 155; Gene Transfer Vectors for Mammalian Cells (J.H. Miller and 
M.P. Calos eds. 1987, Cold Spring Harbor Laboratory); Mayer and Walker, eds. (1987), 
Immunochemical Methods in Cell and Molecular Biology (Academic Press, London); Scopes, 
(1987) Protein Purification: Principles and Practice, Second Edition (Sprmger-Veriag, 
N.Y.), and Handbook of Experimental Immunology, Volumes MV (D.M. Weir and C.C. 
Blackwell eds 1986). 

Standard abbreviations for nucleotides and amino acids are used in this specification. 
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All publications, patents, and patent applications cited herein are incorporated in full 
by reference. 

Expression systems 

The Neisseria menB nucleotide sequences can be expressed in a variety of different 
expression systems; for example those used with mammalian cells, plant cells, baculoviruses, 
bacteria, and yeast. 

i. Mammalian Systems 

Mammalian expression systems are known in the art. A mammahan promoter is any 
DNA sequence capable of binding mammalian RNA polymerase and initiating the 
downstream (3*) transcription of a coding sequence (e.g., structural gene) into mRNA. A 
promoter will have a transcription initiating region, which is usually placed proximal to the 5* 
end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) upstream 
of the transcription initiation site. The TATA box is thought to direct RNA polymerase II to 
begin RNA synthesis at the correct site. A mammalian promoter will also contain an 
upstream promoter element, usually located within 100 to 200 bp upstream of the TATA box. 
An upstream promoter element determines the rate at which transcription is initiated and can 
act in either orientation (Sambrook et al. (1989) "Expression of Cloned Genes in Mammalian 
Cells." In Molecular Cloning: A Laboratory Manual, 2nd ed). 

Mammalian viral genes are often highly expressed and have a broad host range; 
therefore sequences encoding mammalian viral genes provide particularly usefiil promoter 
sequences. Examples include the SV40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter (Ad MLP), and herpes simplex virus promoter. In 
addition, sequences derived from non-viral genes, such as the murine metallothionein gene, 
also provide useful promoter sequences. Expression may be either constitutive or regulated 
(inducible). Depending on the promoter selected, many promotes may be inducible using 
known substrates, such as the use of the mouse mammary tumor virus (MMTV) promoter 
with the glucocorticoid responsive element (ORE) that is induced by glucocorticoid in 
hormone-responsive transformed cells (see for example, U.S. Patent 5,783,681). 

The presence of an enhancer element (enhancer), combined with the promoter 
elements described above, will usually increase expression levels. An enhancer is a 
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regulatory DNA sequence that can stimulate transcription up to 1000-fold when linked to 
homologous or heterologous promoters, with synthesis beginning at the normal RNA start 
site. Enhancers are also active when they are placed upstream or downstream from the 
transcription initiation site, in either normal or flipped orientation, or at a distance of more 
than 1000 nucleotides from the promoter (Maniatis et al. (1987) Science 236:1231 \ Alberts et 
al. (1989) Molecular Biology of the Cell, 2nd ed.). Enhancer elements derived from viruses 
may be particularly useful, because they usually have a broader host range. Examples include 
the SV40 early gene enhancer (Dijkema et al (1985) EMBO J, 4:161) and the 
enhancer/promoters derived from the long terminal repeat (LTR) of the Rous Sarcoma Virus 
(Gorman et al. (1982b) Proc. Natl Acad Set 79:6111) and from human cytomegalovirus 
(Boshart et al. (1985) Cell 41:521), Additionally, some enhancers are regulatable and become 
active only in the presence of an inducer, such as a hormone or metal ion (Sassone-Corsi and 
Borelli (1986) Trends Genet, 2:215; Maniatis et al. (1987) Science 236:1237). 

A DNA molecule may be expressed intracellularly in mammalian cells. A promoter 
sequence may be directly linked with the DNA molecule, in which case the first amino acid at 
the N-terminus of the recombinant protein will always be a methionine, which is encoded by 
the ATG start codon. If desired, the N-terminus may be cleaved from the protein by in vitro 
incubation with cyanogen bromide. 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provides for secretion of the foreign protein in manmialian cells. 
Preferably, there are processing sites encoded between the leader fragment and the foreign 
gene that can be cleaved either in vivo or in vitro. The leader sequence fragment usually 
encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of 
the protein from the cell. The adenovirus tripartite leader is an example of a leader sequence 
that provides for secretion of a foreign protein in mammalian cells. 

Usually, transcription termination and polyadenylation sequences recognized by 
mammalian cells are regulatory regions located 3* to the translation stop codon and thus, 
together with the promoter elements, flank the coding sequence. The 3* terminus of the 
mature mRNA is formed by site-specific post-transcriptional cleavage and polyadenylation 
(Bimstiel et al. (1985) Cell 41:349; Proudfoot and Whitelaw (1988) "Termination and 3* end 
processing of eukaryotic RNA. In Transcription and splicing (ed. B.D. Hames and D.M. 
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Glover); Proudfoot (1989) Trends Biochem, Sci. J 4: 105). These sequences direct the 
transcription of an mRNA which can be translated into the polypeptide encoded by the DNA. 
Examples of transcription terminator/polyadenylation signals include those derived from 
SV40 (Sambrook et al (1989) "Expression of cloned genes in cultured mammalian cells." In 
Molecular Cloning: A Laboratory Manual), 

Usually, the above described components, comprising a promoter, polyadenylation 
signal, and transcription termination sequence are put together into expression constructs. 
Enhancers, introns with functional splice donor and acceptor sites, and leader sequences may 
also be included in an expression construct, if desired. Expression constructs are often 
maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of 
stable maintenance in a host, such as mammalian cells or bacteria. Mammalian replication 
systems include those derived from animal viruses, which require trans-acting factors to 
replicate. For example, plasmids containing the replication systems of papovaviruses, such as 
SV40 (Gluzman (1981) Cell 25:175) or polyomavirus, replicate to extremely high copy 
number in the presence of the appropriate viral T antigen. Additional examples of 
mammalian replicons include those derived from bovine papillomavirus and Epstein-Barr 
virus. Additionally, the repUcon may have two replication systems, thus allowing it to be 
maintained, for example, in mammalian cells for expression and in a prokaryotic host for 
cloning and ampUfication. Examples of such mammalian-bacteria shuttle vectors include 
pMT2 (Kauftnan et al. (1989) Mol. Cell Biol P:946) and pHEBO (Shimizu et al. (1986) Mol 
Cell Biol (5:1074). 

The transformation procedure used depends upon the host to be transformed. Methods 
for introduction of heterologous polynucleotides into mammalian cells are known in the art 
and include dextran-mediated transfection, calcium phosphate precipitation, polybrene 
mediated transfection, protoplast fiision, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

Mammalian cell lines available as hosts for expression are known in the art and 
include many immortalized cell Unes available from the American Type Culture Collection 
(ATCC), including but not limited to, Chinese hamster ovary (CHO) cells, HeLa cells, baby 
hamster kidney (BHK) cells, monkey kidney cells (COS), human hepatocellular carcinoma 
cells (e.g.. Hep G2), and a number of other cell lines. 
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ii. Plant Cellular Expression Systems 

There are many plant cell culture and whole plant genetic expression systems known 
in the art. Exemplary plant cellular genetic expression systems include those described in 
patents, such as: U.S. 5,693,506; US 5,659,122; and US 5,608,143. Additional examples of 
genetic expression in plant cell culture has been described by Zenk, Phytochemistry 30:3861- 
3863 (1991). Descriptions of plant protein signal peptides may be found in addition to the 
references described above in Vaulcombe et al., Mol Gen, Genet, 209:33-40 (1987); Chandler 
et al. Plant Molecular Biology 3:407-418 (1984); Rogers, J. Biol Chem. 260:3731-3738 
(1985); Rothstein et al., Gene 55:353-356 (1987); Whittier et al., Nucleic Acids Research 
15:25 15-2535 (1987); Wirsel et al.. Molecular Microbiology 3:3-14 (1989); Yu et al.. Gene 
122:247-253 (1992). A description of the regulation of plant gene expression by the 
phytohormone, gibberellic acid and secreted enzymes induced by gibberellic acid can be 
found in R.L. Jones and J. MacMillin, Gibberellins: in: Advanced Plant Physiology,. Malcolm 
B. Wilkins, ed., 1984 Pitman Publishing Limited, London, pp. 21-52. References that 
describe other metabolically-regulated genes: Sheen, Plant Cell, 2:1027-1038(1990); Maas et 
al., EMBOJ. 9:3447-3452 (1990); Benkel and Hickey, Proc. Natl Acad. Set. 84:1337-1339 
(1987) 

Typically, using techniques known in the art, a desired polynucleotide sequence is 
inserted into an expression cassette comprising genetic regulatory elements designed for 
operation in plants. The expression cassette is inserted into a desired expression vector with 
companion sequences upstream and downstream from the expression cassette suitable for 
expression in a plant host. The companion sequences will be of plasmid or viral origin and 
provide necessary characteristics to the vector to pemiit the vectors to move DNA from an 
original cloning host, such as bacteria, to the desired plant host. The basic bacterial/plant 
vector construct will preferably provide a broad host range prokaryote replication origin; a 
prokaryote selectable marker; and, for Agrobacterium transformations, T DNA sequences for 
Agrobacterium-mediated transfer to plant chromosomes. Where the heterologous gene is not 
readily amenable to detection, the construct will preferably also have a selectable marker gene 
suitable for determining if a plant cell has been transformed. A general review of suitable 
markers, for example for the members of the grass family, is found in Wilmink and Dons, 
1993, Plant Mol Biol Reptr, 11(2):165-185. 
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Sequences suitable for permitting integration of the heterologous sequence into the 
plant genome are also reconunended. These might include transposon sequences and the like 
for homologous recombination as well as Ti sequences which permit random insertion of a 
heterologous expression cassette into a plant genome. Suitable prokaryote selectable markers 
include resistance toward antibiotics such as ampicillin or tetracycline. Other DNA sequences 
encoding additional functions may also be present in the vector, as is known in the art. 

The nucleic acid molecules of the subject invention may be included into an 
expression cassette for expression of the protein(s) of interest. Usually, there will be only one 
expression cassette, although two or more are feasible. The recombinant expression cassette 
will contain in addition to the heterologous protein encoding sequence the following 
elements, a promoter region, plant 5' untranslated sequences, initiation codon depending upon 
whether or not the structural gene comes equipped with one, and a transcription and 
translation termination sequence. Unique restriction enzyme sites at the 5' and 3* ends of the 
cassette allow for easy insertion into a pre-existing vector. 

A heterologous coding sequence may be for any protein relating to the present 
invention. The sequence encoding the protein of interest will encode a signal peptide which 
allows processing and translocation of the protein, as appropriate, and will usually lack any 
sequence which might result in the binding of the desired protein of the invention to a 
membrane. Since, for the most part, the transcriptional initiation region will be for a gene 
which is expressed and translocated during germination, by employing the signal peptide 
which provides for translocation, one may also provide for translocation of the protein of 
interest. In this way, the protein(s) of interest will be translocated from the cells in which they 
are expressed and may be efficiently harvested. Typically secretion in seeds are across the 
aleurone or scutellar epithelium layer into the endosperm of the seed. While it is not required 
that the protein be secreted from the cells in which the protein is produced, this facilitates the 
isolation and purification of the recombinant protein. 

Since the ultimate expression of the desired gene product will be in a eucaryotic cell it 
is desirable to determine whether any portion of the cloned gene contains sequences which 
will be processed out as introns by the host's splicosome machinery. If so, site-directed 
mutagenesis of the "intron" region may be conducted to prevent losing a portion of the genetic 
message as a false intron code, Reed and Maniatis, Cell 41:95-105, 1985. 
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The vector can be microinjected directly into plant cells by use of micropipettes to 
mechanically transfer the recombinant DNA. Crossway, A/b/. Gen, Genet, 202:179-185, 
1985. The genetic material may also be transferred into the plant cell by using polyethylene 
glycol, Krens, et al. Nature, 296, 72-74, 1982. Another method of introduction of nucleic 
acid segments is high velocity ballistic penetration by small particles with the nucleic acid 
either within the matrix of small beads or particles, or on the surface, Klein, et al.. Nature, 
327, 70-73, 1987 and Knudsen and MuUer, 1991, Planta, 185:330-336 teaching particle 
bombardment of barley endosperm to create transgenic barley. Yet another method of 
introduction would be fusion of protoplasts with other entities, either minicells, cells, 
lysosomes or other fusible lipid-surfaced bodies, Fraley, et al.. Proa Natl Acad. Set USA, 79, 
1859-1863, 1982. 

The vector may also be introduced into the plant cells by electroporation. (Fromm et 
al., Proc. Natl Acad, ScL USA 82:5824, 1985). In this technique, plant protoplasts are 
electroporated in die presence of plasmids containing the gene construct. Electrical impulses 
of high field strength reversibly permeabilize biomembranes allowing the introduction of the 
plasmids. Electroporated plant protoplasts reform the cell wall, divide, and form plant callus. 

All plants fi-om which protoplasts can be isolated and cultured to give whole 
regenerated plants can be transformed by the present invention so that whole plants are 
recovered which contain the transferred gene. It is known that practically all plants can be 
regenerated from cultured cells or tissues, including but not limited to all major species of 
sugarcane, sugar beet, cotton, fruit and other trees, legumes and vegetables. Some suitable 
plants include, for example, species from the genera Fragaria, Lotus, Medicago, Onobrychis, 
Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, 
Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersion, 
Nicotiana, Solanum, Petunia, Digitalis, Majorana, Cichorium, Helianthus, Lactuca, Bromus, 
Asparagus, Antirrhinum, Hererocallis, Nemesia, Pelargonium, Panicum, Pennisetum, 
Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Lolium, Zea, Triticum, 
Sorghum, md Datura, 

Means for regeneration vary from species to species of plants, but generally a 
suspension of transformed protoplasts containing copies of the heterologous gene is first 
provided. Callus tissue is formed and shoots may be induced from callus and subsequently 
rooted. Altematively, embryo formation can be induced from the protoplast suspension. 
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These embryos germinate as natural embryos to form plants. The culture media will generally 
contain various amino acids and hormones, such as auxin and cytokinins. It is also 
advantageous to add glutamic acid and proline to the medium, especially for such species as 
com and alfalfa. Shoots and roots normally develop simultaneously. Efficient regeneration 
will depend on the medium, on the genotype, and on the history of the culture. If these three 
variables are controlled, dien regeneration is fully reproducible and repeatable. 

In some plant cell culture systems, the desired protein of the invention may be 
excreted or alternatively, the protein may be extracted from the whole plant. Where the 
desired protein of the invention is secreted into the medixmi, it may be collected. 
Alternatively, the embryos and embryoless-half seeds or other plant tissue may be 
mechanically disrupted to release any secreted protein between cells and tissues. The mixture 
may be suspended in a buffer solution to retrieve soluble proteins. Conventional protein 
isolation and purification methods will be then used to purify the recombinant protein. 
Parameters of time, temperature pH, oxygen, and volumes will be adjusted through routine 
methods to optimize expression and recovery of heterologous protein. 

iii. Baculovirus Systems 

The polynucleotide encoding the protein can also be inserted into a suitable insect 
expression vector, and is operably linked to the control elements within that vector. Vector 
construction employs techniques which are known in ttie art. Generally, the components of 
the expression system include a transfer vector, usually a bacterial plasmid, which contains 
both a fragment of the baculovirus genome, and a convenient restriction site for insertion of 
the heterologous gene or genes to be expressed; a wild type baculovirus with a sequence 
homologous to the baculovirus-specific fragment in the transfer vector (this allows for the 
homologous recombination of the heterologous gene in to the baculovirus genome); and 
appropriate insect host cells and growth media. 

After inserting the DNA sequence encoding the protein into the transfer vector, the 
vector and the wild type viral genome are transfected into an insect host cell where the vector 
and viral genome are allowed to recombine. The packaged recombinant virus is expressed 
and recombinant plaques are identified and purified. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form fi-om, inter 
alia, Invitrogen, San Diego CA ("MaxBac" kit). These techniques are generally known to 
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those skilled in the art and fully described in Summers and Smith, Texas Agricultural 
Experiment Station Bulletin No, 1555 (1987) (hereinafter "Summers and Smith"). 

Prior to inserting die DNA sequence encoding the protein into die baculovirus 
genome, the above described components, comprising a promoter, leader (if desired), coding 
sequence of interest, and transcription termination sequence, are usually assembled into an 
intermediate transplacement construct (transfer vector). This construct may contain a single 
gene and operably linked regulatory elements; multiple genes, each with its owned set of 
operably linked regulatory elements; or multiple genes, regulated by the same set of 
regulatory elements. Intermediate transplacement constructs are often maintained in a 
replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable maintenance 
in a host, such as a bacterium. The replicon will have a replication system, thus allowing it to 
be maintained in a suitable host for cloning and amplification. 

Currently, the most commonly used transfer vector for introducing foreign genes into 
AcNPV is pAc373. Many other vectors, known to those of skill in the art, have also been 
designed. These include, for example, pVL985 (which alters the polyhedrin start codon from 
ATG to ATT, and which mtroduces a BamHI cloning site 32 basepairs downstream from the 
ATT; see Luckow and Summers, Virology (1989) 77:31. 

The plasmid usually also contains the polyhedrin polyadenylation signal (Miller et al. 
(1988) Ann, Rev, Microbiol,, 42:111) and a prokaryotic ampicillin-resistance {amp) gene and 
origin of replication for selection and propagation in E, coli, 

Baculovirus transfer vectors usually contain a baculovirus promoter. A baculovirus 
promoter is any DNA sequence capable of binding a baculovirus RNA polymerase and 
initiating the downstream (5' to 3*) transcription of a coding sequence (e.g., structural gene) 
into mRNA. A promoter will have a transcription initiation region which is usually placed 
proximal to the 5' end of the coding sequence. This transcription initiation region usually 
includes an RNA polymerase binding site and a transcription initiation site. A baculovirus 
transfer vector may also have a second domain called an enhancer, which, if present, is 
usually distal to the structural gene. Expression may be either regulated or constitutive. 

Structural genes, abundantly transcribed at late times in a viral infection cycle, provide 
particularly usefiil promoter sequences. Examples include sequences derived from the gene 
encoding the viral polyhedron protein, Friesen et al., (1986) "The Regulation of Baculovirus 
Gene Expression," in: The Molecular Biology of Baculovirus es (ed. Walter Doerfler); EPO 
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Publ. Nos. 127 839 and 155 476; and the gene encoding the p 10 protein, Vlak et al., (1988), 1 
Gen, Virol 69:165. 

DNA encoding suitable signal sequences can be derived from genes for secreted insect 
or baculovirus proteins, such as the baculovirus polyhedrin gene (Carbonell et al. (1988) 
Gene, 75:409). Alternatively, since the signals for manunahan cell posttranslational 
modifications (such as signal peptide cleavage, proteolytic cleavage, and phosphorylation) 
appear to be recognized by insect cells, and the signals required for secretion and nuclear 
accumulation also appear to be conserved between the invertebrate cells and vertebrate cells, 
leaders of non-insect origm, such as those derived from genes encoding human (alpha) a- 
interferon, Maeda et al., (1985), Nature 5/5:592; human gastrin-releasing peptide, Lebacq- 
Verheyden et al., (1988), Molec. Cell Biol 5:3129; human IL-2, Smith et al., (1985) Proc. 
NaVl Acad, ScL USA, 52:8404; mouse IL-3, (Miyajima et al., (1987) Gene 58:273; and human 
glucocerebrosidase, Martin et al. (1988) DNA, 7:99, can also be used to provide for secretion 
in insects. 

A recombinant polypeptide or polyprotein may be expressed intracellularly or, if it is 
expressed with the proper regulatory sequences, it can be secreted. Good intracellular 
expression of nonfiised foreign proteins usually requires heterologous genes that ideally have 
a short leader sequence containing suitable translation initiation signals preceding an ATG 
start signal. If desired, metiiionine at the N-terminus may be cleaved from the mature protein 
by in vitro incubation with cyanogen bromide. 

Alternatively, recombinant polyproteins or proteins which are not naturally secreted 
can be secreted from the insect cell by creating chimeric DNA molecules that encode a ftision 
protein comprised of a leader sequence fragment that provides for secretion of the foreign 
protein in insects. The leader sequence fragment usually encodes a signal peptide comprised 
of hydrophobic amino acids which direct the translocation of the protein into the endoplasmic 
reticulum. 

After insertion of the DNA sequence and/or the gene encoding the expression product 
precursor of the protein, an insect cell host is co-transformed with the heterologous DNA of 
the transfer vector and the genomic DNA of wild type baculovirus - usually by co- 
transfection. The promoter and transcription termination sequence of the construct will 
usually comprise a 2-5kb section of the baculovirus genome. Methods for introducing 
heterologous DNA into the desired site in the baculovirus virus are known in the art. (See 
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Summers and Smiihsupra; Ju et al, (1987); Smith et al., Mol. Cell Biol (1983) 5:2156; and 
Luckow and Summers (1989)). For example, the insertion can be into a gene such as the 
polyhedrin gene, by homologous double crossover recombination; insertion can also be into a 
restriction enzyme site engineered into the desired baculovirus gene. Miller et al, (1989), 
Bioessays 4:9\. The DNA sequence, when cloned in place of the polyhedrin gene in the 
expression vector, is flanked both 5' and 3' by polyhedrin-specific sequences and is positioned 
downstream of the polyhedrin promoter. 

The newly formed baculovirus expression vector is subsequently packaged into an 
infectious recombinant baculovirus. Homologous recombination occurs at low frequency 
(between about 1% and about 5%); thus, the majority of the virus produced after 
cotransfection is still wild-type virus. Therefore, a method is necessary to identify 
recombinant viruses. An advantage of the expression system is a visual screen allowing 
recombinant viruses to be distinguished. The polyhedrin protein, which is produced by the 
native virus, is produced at very high levels in the nuclei of infected cells at late times after 
viral infection. Accumulated polyhedrin protein forms occlusion bodies that also contain 
embedded particles. These occlusion bodies, up to 15 pm in size, are highly refractile, giving 
them a bright shiny appearance that is readily visualized under the light microscope. Cells 
infected with recombinant viruses lack occlusion bodies. To distinguish recombinant virus 
from wild-type virus, the transfection supernatant is plaqued onto a monolayer of insect cells 
by techniques known to those skilled in the art. Namely, the plaques are screened under the 
light microscope for the presence (indicative of wild-type virus) or absence (indicative of 
recombinant virus) of occlusion bodies. Current Protocols in Microbiology Vol. 2 (Ausubel 
et al. eds) at 16.8 (Supp. 10, 1990); Sununers and Smith, supra\ Miller et al. (1989). 

Recombinant baculovirus expression vectors have been developed for infection into 
several insect cells. For example, recombinant baculoviruses have been developed for, inter 
alia: Aedes aegypti , Autographa califomica, Bombyx mori, Drosophila melanogaster, 
Spodopterafrugiperda, and Trichoplusia ni (PCT Pub. No. WO 89/046699; Carbonell et al., 
(1985) J. Virol 55:153; Wright (1986) A^a/wre 527:718; Smith etal., (1983) Mo/. Cell Biol 
5:2156; and see generally, Fraser, etal (1989) /w Vitro Cell Dev. Biol 25:225). 

Cells and cell culture media are commercially available for both direct and fusion 
expression of heterologous polypeptides in a baculovirus/expression system; cell culture 
technology is generally known to those skilled in the art. See, e.g., Summers and Smith supra. 
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The modified insect cells may then be grown in an appropriate nutrient medium, 
which allows for stable maintenance of the plasmid(s) present in the modified insect host. 
Where the expression product gene is under inducible control, the host may be grown to high 
density, and expression induced. Alternatively, where expression is constitutive, the product 
will be continuously expressed into the medium and the nutrient medium must be 
continuously circulated, while removing the product of interest and augmenting depleted 
nutrients. The product may be purified by such techniques as chromatography, e.g., HPLC, 
affinity chromatography, ion exchange chromatography, etc.; electrophoresis; density gradient 
centrifugation; solvent extraction, or the like. As appropriate, the product may be further 
purified, as required, so as to remove substantially any insect proteins which are also secreted 
in the medium or resuU fiom lysis of insect cells, so as to provide a product which is at least 
substantially free of host debris, e.g., proteins, lipids and polysaccharides. 

In order to obtain protein expression, recombinant host cells derived from the 
transformants are incubated under conditions which allow expression of the recombinant 
protein encoding sequence. These conditions will vary, dependent upon the host cell selected. 
However, the conditions are readily ascertainable to those of ordinary skill in the art, based 
upon what is known in the art. 

iv. Bacterial Systems 

Bacterial expression techniques are known in the art. A bacterial promoter is any DNA 
sequence capable of binding bacterial RNA polymerase and initiating the downstream (3') 
transcription of a coding sequence (e.g. structural gene) into mRNA. A promoter will have a 
transcription initiation region which is usually placed proximal to the 5' end of the coding 
sequence. This transcription initiation region usually includes an RNA polymerase binding 
site and a transcription initiation site. A bacterial promoter may also have a second domain 
called an operator, that may overiap an adjacent RNA polymerase binding site at which RNA 
synthesis begins. The operator permits negative regulated (inducible) transcription, as a gene 
repressor protein may bind the operator and thereby inhibit transcription of a specific gene. 
Constitutive expression may occur in the absence of negative regulatory elements, such as the 
operator. In addition, positive regulation may be achieved by a gene activator protein binding 
sequence, which, if present is usually proximal (5') to the RNA polymerase binding sequence. 
An example of a gene activator protein is the catabolite activator protein (CAP), which helps 
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initiate transcription of the lac operon in Escherichia coli (E. coli) (Raibaud et al (1984) 
Annu. Rev. Genet J 8:173). Regulated expression may therefore be either positive or negative, 
thereby either enhancing or reducing transcription. 

Sequences encoding metabolic pathway enzymes provide particularly useful promoter 
sequences. Examples include promoter sequences derived from sugar metabolizing enzymes, 
such as galactose, lactose (lac) (Chang et al. (1977) Nature 7PS:1056), and maltose. 
Additional examples include promoter sequences derived from biosynthetic enzymes such as 
tryptophan (trp) (Goeddel et aL (1980) Nuc. Acids Res. 5:4057; Yelverton et al. (1981) Nucl. 
Acids Res. P:731; U.S. Patent 4,738,921; EPO Publ. Nos. 036 776 and 121 775). The beta- 
lactamase (bla) promoter system (Weissmann (1981) "The cloning of interferon and other 
mistakes." In Interferon 3 (ed. I. Gresser)), bacteriophage lambda PL (Shimatake et al. (1981) 
Nature 2P2: 128) and T5 (U.S. Patent 4,689,406) promoter systems also provide useful 
promoter sequences. 

In addition, synthetic promoters which do not occur in nature also function as bacterial 
promoters. For example, transcription activation sequences of one bacterial or bacteriophage 
promoter may be joined with the operon sequences of another bacterial or bacteriophage 
promoter, creating a synthetic hybrid promoter (U.S. Patent 4,551,433). For example, the tac 
promoter is a hybrid trp-lac promoter comprised of both trp promoter and lac operon 
sequences that is regulated by the lac repressor (Amann et al. (1983) Gene 25:167; de Boer et 
al. (1983) Proc. Natl. Acad. Sci. 80:21). Furthermore, a bacterial promoter can include 
naturally occurring promoters of non-bacterial origin that have the ability to bind bacterial 
RNA polymerase and initiate transcription. A naturally occurring promoter of non-bacterial 
origin can also be coupled with a compatible RNA polymerase to produce high levels of 
expression of some genes in prokaryotes. The bacteriophage T7 RNA polymerase/promoter 
system is an example of a coupled promoter system (Studier et aL (1986) J. Mol Biol. 
189:113; Tabor et al. (1985) Proc Natl. Acad Sci. 82:1074). In addition, a hybrid promoter 
can also be comprised of a bacteriophage promoter and an E. coli opemtor region (EPO Publ. 
No. 267 851). 

In addition to a functioning promoter sequence, an efficient ribosome binding site is 
also useful for the expression of foreign genes in prokaryotes. InE. coli, the ribosome binding 
site is called the Shine-Dalgamo (SD) sequence and includes an initiation codon (ATG) and a 
sequence 3-9 nucleotides in length located 3-1 1 nucleotides upstream of the initiation codon 
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(Shine et al (1975) Nature 254:34). The SD sequence is thought to promote binding of 
mRNA to the ribosome by the pairing of bases between the SD sequence and the 3' end of 
E, coli 16S rRNA (Steitz et al (1979) "Genetic signals and nucleotide sequences in 
messenger RNA." \n Biological Regulation and Development: Gene Expression (ed. R.F. 
Goldberger)). To express eukaryotic genes and prokaryotic genes with weak ribosome- 
binding site, it is often necessary to optimize the distance between the SD sequence and the 
ATG of the eukaryotic gene (Sambrook et al (1989) "Expression of cloned genes in 
Escherichia coli." In Molecular Cloning: A Laboratory Manual), 

A DNA molecule may be expressed intracellularly. A promoter sequence may be 
directly linked with the DNA molecule, in >yhich case the first amino acid at the N-terminus 
will always be a methionine, which is encoded by the ATG start codon. If desired, methionine 
at the N-terminus may be cleaved from the protein by in vitro incubation with cyanogen 
bromide or by either in vivo or in vitro incubation with a bacterial methionine N-terminal 
peptidase (EPO Publ. No. 2 19 237). 

Fusion proteins provide an alternative to direct expression. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous bacterial protein, or other stable protein, is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct will 
provide a fiision of the two amino acid sequences. For example, the bacteriophage lambda cell 
gene can be linked at the 5' terminus of a foreign gene and expressed in bacteria. The resulting 
fusion protein preferably retains a site for a processing enzyme (factor Xa) to cleave the 
bacteriophage protein from the foreign gene (Nagai et al (1984) Nature 309:810), Fusion 
proteins can also be made with sequences from the lacZ (Jia et al (1987) Gene 50:197), trpE 
(Allen et al (1987) J. BiotechnoL 5:93; Makoff al (1989) 7. Gen, Microbiol 755:1 1), and 
Chey (EPO Publ No. 324 647) genes. The DNA sequence at the junction of the two amino 
acid sequences may or may not encode a cleavable site. Another example is a ubiquitin fusion 
protein. Such a fusion protein is made with the ubiquitin region that preferably retains a site 
for a processing en2yme (e.g. ubiquitin specific processing-protease) to cleave the ubiquitin 
from the foreign protein. Through this method, native foreign protein can be isolated (Miller 
etal (1989) Bio/Technology 7:698). 

Alternatively, foreign proteins can also be secreted from the cell by creating chimeric 
DNA molecules that encode a fusion protein comprised of a signal peptide sequence fragment 
that provides for secretion of the foreign protein in bacteria (U.S. Patent 4,336,336). The 
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signal sequence fragment usually encodes a signal peptide comprised of hydrophobic amino 
acids which direct the secretion of the protein from the cell. The protein is either secreted into 
the growth media (gram-positive bacteria) or into the periplasmic space, located between the 
inner and outer membrane of the cell (gram-negative bacteria). Preferably there are processing 
sites, which can be cleaved either in vivo or in vitro encoded between the signal peptide 
fragment and the foreign gene, 

DNA encoding suitable signal sequences can be derived from genes for secreted 
bacterial proteins, such as the E. coli outer membrane protein gene (ompA) (Masui et al 
(1983), in: Experimental Manipulation of Gene Expression; Ghrayeb et al, (1984) EMBO J. 
J:2437) and the E. coli alkaline phosphatase signal sequence (phoA) (Oka et al. (1985) Proa 
Natl. Acad. Sci. 52:7212). As an additional example, the signal sequence of the alpha-amylase 
gene from various Bacillus strains can be used to secrete heterologous proteins from B. 
subtilis (Palva et al (1982) Proc. Natl Acad. Set USA 7P:5582; EPO Pubh No. 244 042). 

Usually, transcription termination sequences recognized by bacteria are regulatory 
regions located 3' to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Transcription termination sequences 
frequently include DNA sequences of about 50 nucleotides capable of forming stem loop 
structures that aid in terminating transcription. Examples include transcription termination 
sequences derived from genes with strong promoters, such as the trp gene in E. coli as well as 
other biosynthetic genes. 

Usually, the above described components, comprising a promoter, signal sequence (if 
desired), coding sequence of interest, and transcription termination sequence, are put together 
into expression constructs. Expression constructs are often maintained in a replicon, such as 
an extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
bacteria. The replicon will have a replication system, thus allowing it to be maintained in a 
prokaryotic host either for expression or for cloning and amplification. In addition, a replicon 
may be either a high or low copy number plasmid, A high copy number plasmid will 
generally have a copy number ranging from about 5 to about 200, and usually about 10 to 
about 150. A host containing a high copy number plasmid will preferably contain at least 
about 10, and more preferably at least about 20 plasmids. Either a high or low copy number 
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vector may be selected, depending upon the effect of the vector and the foreign protein on the 
host. 

Alternatively, the expression constructs can be integrated into the bacterial genome 
with an integrating vector. Integrating vectors usually contain at least one sequence 
homologous to the bacterial chromosome that allows the vector to integrate. Integrations 
appear to result from recombinations between homologous DNA in the vector and the 
bacterial chromosome. For example, integrating vectors constructed with DNA from various 
Bacillus strains integrate into the Bacillus chromosome (EPO Publ. No. 127 328). Integrating 
vectors may also be comprised of bacteriophage or transposon sequences. 

Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of bacterial strains diat have been transformed. 
Selectable markers can be expressed in the bacterial host and may include genes which render 
bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, kanamycin 
(neomycin), and tetracycline (Davies et al (1978) Annu. Rev. Microbiol 52:469). Selectable 
markers may also include biosynthetic genes, such as those in the histidine, tryptophan, and 
leucine biosynthetic pathways. 

Alternatively, some of the above described components can be put together in 
transformation vectors. Transformation vectors are usually comprised of a selectable market 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extra-chromosomal replicons or 
integrating vectors, have been developed for transformation into many bacteria. For example, 
expression vectors have been developed for, inter alia, the following bacteria: Bacillus 
subtilis (Palva et al (1982) Proc, Natl Acad. Scl USA 7P:5582; EPO Publ. Nos. 036 259 and 
063 953; PCT Publ. No. WO 84/04541), Escherichia coli (Shimatake et al (1981) Nature 
292:128; Amann et al (1985) Gene 40:183; Studier et al (1986) 7. Mol Biol 189:113; EPO 
Publ. Nos. 036 776, 136 829 and 136 907), Streptococcus cremoris (Powell et al (1988) Appl 
Environ. Microbiol 5^:655); Streptococcus lividans (Powell et al (1988) Appl Environ. 
Microbiol 54:655), Streptomyces lividans (U.S. Patent 4,745,056). 

Methods of introducing exogenous DNA into bacterial hosts are well-known in the art, 
and usually include either the transformation of bacteria treated with CaCl2 or other agents, 
such as divalent cations and DMSO. DNA can also be introduced into bacterial cells by 
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electroporation. Transformation procedures usually vary with the bacterial species to be 
transformed. (See e.g., use of Bacillus: Masson et al (1989) FEMS Microbiol Lett. 60:273; 
Palva et al (1982) Proc, Natl Acad. ScL USA 7P:5582; EPO Publ. Nos. 036 259 and 063 953; 
PCT Publ. No. WO 84/04541; use of Campylobacter: Miller et al, (1988) Proa Natl Acad. 
ScL 55:856; and Wang et al (1990) J. Bacteriol 772:949; use of Escherichia coli: Cohen et 
al (1973) Proc. Natl Acad ScL 69:21 10; Dower et al (1988) Nucleic Acids Res. 76:6127; 
Kushner (1978) "An improved method for transformation of Escherichia coli with ColEl- 
derived plasmids. In Genetic Engineering: Proceedings of the International Symposium on 
Genetic Engineering (eds. H.W. Beyer and S. Nicosia); Mandel et al (1970) J. Mol Biol 
55:159; Taketo (1988) Biochim. Biophys. Acta 949:318; use of Lactobacillus: Chassy et al 
(1987) FEMS Microbiol Lett 44:113; use of Pseudomonas: Fiedler al (1988) Anal 
Biochem 170:3%; use of Staphylococcus: Angustin et al (1990) FEMS Microbiol Lett. 
66:203; use of Streptococcus: Barany etal (1980)7. Bacteriol 144:698; Harlander (1987) 
"Transformation of Streptococcus lactis by electroporation, in: Streptococcal Genetics (ed. J. 
Ferretti and R. Curtiss HI); Perry et al (1981) Infect Immun. 32:1295; Powell et al (1988) 
Appl Environ. Microbiol 54:655; Somkuti et al. (1987) Proa 4th Evr, Cong. Biotechnology 
7:412. 

V. Yeast Expression 

Yeast expression systems are also known to one of ordinary skill in the art. A yeast 
promoter is any DNA sequence capable of binding yeast RNA polymerase and initiating the 
downstream (3') transcription of a coding sequence (e.g. structural gene) into mRNA. A 
promoter will have a transcription initiation region which is usually placed proximal to the 5' 
end of the coding sequence. This transcription initiation region usually includes an RNA 
polymerase binding site (the "TATA Box") and a transcription initiation site. A yeast 
promoter may also have a second domain called an upstream activator sequence (UAS), 
which, if present, is usually distal to the structural gene. The UAS permits regulated 
(inducible) expression. Constitutive expression occurs in the absence of a UAS. Regulated 
expression may be either positive or negative, thereby either enhancing or reducing 
transcription. 

Yeast is a fermenting organism with an active metabolic pathway, therefore sequences 
encoding en2ymes in the metabolic pathway provide particularly useful promoter sequences. 
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Examples include alcohol dehydrogenase (ADH) (EPO Publ. No. 284 044), enolase, 
glucokinase, glucose-6-phosphate isomerase, glyceraldehyde-3-phosphate-dehydrogenase 
(GAP or GAPDH), hexokinase, phosphofhictokinase, 3-phosphoglycerate mutase, and 
pyruvate kinase (PyK) (EPO Publ. No. 329 203). The yeast PH05 gene, encoding acid 
phosphatase, also provides useful promoter sequences (Myanohara et al (1983) Proc. Natl 
Aca±ScL USA80:iy 

In addition, synthetic promoters which do not occur in nature also function as yeast 
promoters. For example, UAS sequences of one yeast promoter may be joined with the 
transcription activation region of another yeast promoter, creating a synthetic hybrid 
promoter. Examples of such hybrid promoters include the ADH regulatory sequence linked to 
the GAP transcription activation region (U.S. Patent Nos. 4,876,197 and 4,880,734). Other 
examples of hybrid promoters include promoters which consist of the regulatory sequences of 
either the ADH2, GAL4, GALJO, OR PH05 genes, combined with the transcriptional 
activation region of a glycolytic enzyme gene such as GAP or PyK (EPO Publ. No. 164 556). 
Furthermore, a yeast promoter can include naturally occurring promoters of non-yeast origin 
that have the ability to bind yeast RNA polymerase and initiate transcription. Examples of 
such promoters include, inter alia, (Cohen et al (1980) Proc. Natl Acad, Set USA 77: 1078; 
Henikoff et al (1981) Nature 283:835\ Hollenberg et al (1981) Curr. Topics Microbiol 
Immunol 96:119; Hollenberg et al (1979) *The Expression of Bacterial Antibiotic Resistance 
Genes in the Yeast Saccharomyces cerevisiae," in: Plasmids of Medical, Environmental and 
Commercial Importance (eds. K.N. Timmis and A. Puhler); Mercerau-Puigalon et al (1980) 
Gene 11:163; Panthier et al (1980) Curr Genet. 2:109;). 

A DNA molecule may be expressed intracellularly in yeast. A promoter sequence may 
be directly linked with the DNA molecule, in which case the first amino acid at the N- 
terminus of the recombinant protein will always be a methionine, which is encoded by the 
ATG start codon. If desired, methionine at the N-terminus may be cleaved from the protein by 
in vitro incubation with cyanogen bromide. 

Fusion proteins provide an alternative for yeast expression systems, as well as in 
mammalian, plant, baculovirus, and bacterial expression systems. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous yeast protein, or other stable protein, is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the yeast or human 
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superoxide dismutase (SOD) gene, can be linked at the 5' terminus of a foreign gene and 
expressed in yeast. The DNA sequence at the junction of the two amino acid sequences may 
or may not encode a cleavable site. See e.g., EPO Publ. No. 196056. Another example is a 
ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin region that 
preferably retains a site for a processing enzyme (e.g. ubiquitin-specific processing protease) 
to cleave the ubiquitin from the foreign protein. Through this method, therefore, native 
foreign protein can be isolated (e.g., W08 8/024066). 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provide for secretion in yeast of the foreign protein. Preferably, there 
are processing sites encoded between the leader fragment and the foreign gene that can be 
cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct the secretion of the protein from 
the cell. 

DNA encoding suitable signal sequences can be derived from genes for secreted yeast 
proteins, such as the yeast invertase gene (EPO Publ. No. 012 873; JPO Publ. No. 
62:096,086) and the A-factor gene (U.S. Patent 4,588,684). Alternatively, leaders of non- 
yeast origin, such as an interferon leader, exist that also provide for secretion in yeast (EPO 
Publ. No. 060 057). 

A preferred class of secretion leaders are those that employ a fragment of the yeast 
alpha-factor gene, which contains both a "pre" signal sequence, and a "pro" region. The types 
of alpha-factor fragments that can be employed include the full-length pre-pro alpha factor 
leader (about 83 amino acid residues) as well as truncated alpha-factor leaders (usually about 
25 to about 50 amino acid residues) (U.S. Patent Nos. 4,546,083 and 4,870,008; EPO Publ. 
No. 324 274). Additional leaders employing an alpha-factor leader fragment that provides for 
secretion include hybrid alpha-factor leaders made with a presequence of a first yeast, but a 
pro-region from a second yeast alphafactor. (See e.g., PCT Publ. No. WO 89/02463.) 

Usually, transcription termination sequences recognized by yeast are regulatory 
regions located 3' to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Examples of transcription terminator 
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sequence and other yeast-recognized termination sequences, such as those coding for 
glycolytic enzymes. 

Usually, the above described components, comprising a promoter, leader (if desired), 
coding sequence of interest, and transcription termination sequence, are put together into 
expression constructs. Expression constructs are often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
yeast or bacteria. The replicon may have two replication systems, thus allowing it to be 
maintained, for example, in yeast for expression and in a prokaiyotic host for cloning and 
amplification. Examples of such yeast-bacteria shuttle vectors include YEp24 (Botstein et al 
(1979) Gene 5:17-24), pCl/1 (Brake et al (1984) Proc, Natl Acad. Sci USA 57:4642-4646), 
and YRpl7 (Stinchcomb et al (1982) /. Mol Biol 158:157), In addition, a replicon may be 
either a high or low copy number plasmid. A high copy number plasmid will generally have a 
copy number ranging from about 5 to about 200, and usually about 10 to about 150. A host 
containing a high copy number plasmid will preferably have at least about 10, and more 
preferably at least about 20. Enter a high or low copy number vector may be selected, 
depending upon the effect of the vector and the foreign protein on the host. See e.g., Brake et 
alj supra. 

Alternatively, the expression constructs can be integrated into the yeast genome with 
an integrating vector. Integrating vectors usually contain at least one sequence homologous to 
a yeast chromosome that allows the vector to integrate, and preferably contain two 
homologous sequences flanking the expression construct. Integrations appear to resuU from 
recombinations between homologous DNA in the vector and the yeast chromosome (Orr- 
Weaver et al (1983) Methods in Enzymol 707:228-245). An integrating vector may be 
directed to a specific locus in yeast by selecting the appropriate homologous sequence for 
inclusion in the vector. See Orr- Weaver et al, supra. One or more expression construct may 
integrate, possibly affecting levels of recombinant protein produced (Rine et al (1983) Proa 
Natl Acad, Set USA 50:6750). The chromosomal sequences included in the vector can occur 
either as a single segment in the vector, which results in the integration of the entire vector, or 
two segments homologous to adjacent segments in the chromosome and flanking the 
expression construct in the vector, which can result in the stable integration of only the 
expression construct. 
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Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of yeast strains that have been transformed. 
Selectable markers may include biosynthetic genes that can be expressed in the yeast host, 
such as ADE2, HIS4, LEU2, TRPl, and^GZ, and the G418 resistance gene, which confer 
resistance in yeast cells to tunicamycin and G418, respectively. In addition,, a suitable 
selectable marker may also provide yeast with the ability to grow in the presence of toxic 
compounds, such as metal. For example, the presence of CUPl allows yeast to grow in the 
presence of copper ions (Butt et al {19^1) Microbiol Rev, J7:351). 

Alternatively, some of the above described components can be put together into 
transformation vectors. Transformation vectors are usually comprised of a selectable marker 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extrachromosomal replicons or 
integrating vectors, have been developed for transformation into many yeasts. For example, 
expression vectors and methods of introducing exogenous DNA into yeast hosts have been 
developed for, inter alia, the following yeasts: Candida albicans (Kurtz, et al (1986) Mol 
Cell Biol 5:142); Candida maltosa (Kunze, etal (1985) J. Basic Microbiol 25:141); 
Hansenula polymorpha (Gleeson, etal (1986)7. Gen. Microbiol 132:3459; Roggenkamp et 
al (1986) Mol Gen, Genet, 202:302); Kluyveromyces fragilis (Das, et al (1984) J. Bacteriol 
755:1 165); Kluyveromyces lactis (De Louvencourt et al (1983) J. Bacteriol 154:737; Van 
den Berg et al (1990) Bio/Technology <S:135); Pichia guillenmondii (Kunze et al (1985) J, 
Basic Microbiol 25:141); Pichia pastoris (Gregg, et al (1985) Mol Cell Biol 5:3376; U.S. 
Patent Nos. 4,837,148 and 4,929,555); Saccharomyces cerevisiae (Hinnen et al (1978) Proc, 
Natl Acad, Scl USA 75:1929; Ito et al (1983)7. Bacteriol 755:163); Schizosaccharomyces 
pombe (Beach and Nurse (1981) Nature 500:706); and Yarrowia lipolytica (Davidow, et al 
(1985) Curr, Genet, 70:380471 Gaillardin, et al (1985) Curr. Genet 70:49). 

Methods of introducing exogenous DNA into yeast hosts are well-known in the art, 
and usually include either the transformation of spheroplasts or of intact yeast cells treated 
with alkali cations. Transformation procedures usually vary with the yeast species to be 
transformed. See e.g., [Kurtz et al (1986) Mol Cell Biol 6:142; Kunze et al (1985) J. Basic 
Microbiol 25:141; Candida]; [Gleeson et al (1986) 7. Gen, Microbiol 752:3459; 
Roggenkamp et al, (1986) Mol Gen, Genet, 202:302; Hansenula]; [Das et al (1984) 7 
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Bacteriol 755:1165; De Louvencourte/ a/. (1983)7. Bacteriol 75^:1165; Van den Berg a/. 
{\990) Bio/Technology 5:135; Kluyveromyces]; [Cregg etal (l9B5)MoL Cell, Biol 5:3376; 
Kunze et al (1985) J. Basic Microbiol 25:141 ; U.S. Patent Nos. 4,837,148 and 4,929,555; 
Pichia]; [Hinnen et al (1978) Proc, Natl Acad, ScL USA 75;1929; Ito et al (1983) J. 
Bacteriol 153:163 Saccharomyces]; [Beach and Nurse (1981) Nature 300:706; 
Schizosaccharomyces]; [Davidow et al (1985) Curr. Genet. 70:39; Gaillardin et al (1985) 
Curr, Genet, 70:49; Yanowia]. 

Definitions 

A composition containing X is "substantially free of Y when at least 85% by weight 
of the total X+Y in the composition is X. Preferably, X comprises at least about 90% by 
weight of the total of X+Y in the composition, more preferably at least about 95% or even 
99% by weight. 

A "conserved" Neisseria amino acid fi-agment or protein is one that is present in a 
particular Neisserial protein in at least x% of Neisseria, The value of x may be 50% or more, 
e.g., 66%, 75%), 80%, 90%, 95% or even 100% (i.e. the amino acid is found in the protein in 
question in all Neisseria), In order to determine whether an animo acid is "conserved" in a 
particular Neisserial protein, it is necessary to compare that amino acid residue in the 
sequences of the protein in question fi-om a plurality of different Neisseria (a reference 
population). The reference population may include a number of different Neisseria species 
or may include a single species. The reference population may include a number of different 
serogroups of a particular species or a single serogroup. A preferred reference population 
consists of the 5 most common Neisseria strains. 

The term "heterologous" refers to two biological components that are not found 
together in nature. The components may be host cells, genes, or regulatory regions, such as 
promoters. Although the heterologous components are not found together in nature, they can 
function together, as when a promoter heterologous to a gene is operably Unked to the gene. 
Another example is where a Neisserial sequence is heterologous to a mouse host cell. 

"Epitope" means antigenic determinant, and may elicit a cellular and/or humoral 
response. 

Conditions for "high stringency" are 65 degrees C in 0.1 xSSC 0.5% SDS solution. 
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An "origin of replication" is a polynucleotide sequence that initiates and regulates 
replication of polynucleotides, such as an expression vector. The origin of replication behaves 
as an autonomous unit of polynucleotide replication within a cell, capable of replication under 
its own control. An origin of replication may be needed for a vector to rephcate in a particular 
host cell. With certain origins of replication, an expression vector can be reproduced at a high 
copy niunber in the presence of the appropriate proteins within the cell. Examples of origins 
are the autonomously replicating sequences, which are effective in yeast; and the viral 
T-antigen, effective in COS-7 cells. 

A "mutant" sequence is defined as a DNA, RNA or amino acid sequence differing 
from but having homology with the native or disclosed sequence. Depending on the 
particular sequence, the degree of homology (sequence identity) between the native or 
disclosed sequence and the mutant sequence is preferably greater than 50% (e.g., 60%, 70%, 
80%, 90%, 95%, 99% or more) which is calculated as described above. As used herein, an 
"allelic variant" of a nucleic acid molecule, or region, for which nucleic acid sequence is 
provided herein is a nucleic acid molecule, or region, that occurs at essentially the same locus 
in the genome of another or second isolate, and that, due to natural variation caused by, for 
example, mutation or recombination, has a similar but not identical nucleic acid sequence. A 
coding region allelic variant typically encodes a protein having similar activity to that of the 
protein encoded by the gene to which it is being compared. An allelic variant can also 
comprise an alteration in the 5' or 3* untranslated regions of the gene, such as in regulatory 
control regions, (see, for example, U.S. Patent 5,753,235). 

Antibodies 

As used herem, the term "antibody" refers to a polypeptide or group of polypeptides 
composed of at least one antibody combining site. An "antibody combining site" is the 
three-dimensional binding space with an internal surface shape and charge distribution 
complementary to the features of an epitope of an antigen, which allows a binding of the 
antibody with the antigen. "Antibody" includes, for example, vertebrate antibodies, hybrid 
antibodies, chimeric antibodies, humanized antibodies, altered antibodies, univalent 
antibodies. Fab proteins, and single domain antibodies. 

Antibodies against the proteins of the invention are useful for affinity 
chromatography, immunoassays, and distinguishing/identifying Neisseria menB proteins. 



wo 99/57280 




PCT/US99/09346 



31 



Antibodies elicited against the proteins of the present invention bind to antigenic polypeptides 
or proteins or protein fragments that are present and specifically associated with strains of 
Neisseria meningitidis menB. In some instances, these antigens may be associated with 
specific strains, such as those antigens specific for the menB strains. The antibodies of the 
invention may be immobilized to a matrix and utilized in an immunoassay or on an affinity 
chromatography column, to enable the detection and/or separation of polypeptides, proteins or 
protein Augments or cells comprising such polypeptides, proteins or protein fragments. 
Alternatively, such polypeptides, proteins or protein Augments may be immobilized so as to 
detect antibodies bindably specific thereto. 

Antibodies to the proteins of the invention, both polyclonal and monoclonal, may be 
prepared by conventional methods. In general, the protein is first used to immunize a suitable 
animal, preferably a mouse, rat, rabbit or goat. Rabbits and goats are preferred for the 
preparation of polyclonal sera due to the volume of serum obtainable, and the availability of 
labeled anti-rabbit and anti-goat antibodies. Immunization is generally performed by mixing 
or emulsifying the protein in saline, preferably in an adjuvant such as Freund's complete 
adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously or 
intramuscularly). A dose of 50-200 ^g/injection is typically sufficient. Immunization is 
generally boosted 2-6 weeks later with one or more injections of the protein in saline, 
preferably using Freund's incomplete adjuvant. One may altematively generate antibodies by 
in vitro immunization using methods known in the art, which for the purposes of this 
invention is considered equivalent to in vivo immunization. Polyclonal antisera is obtained by 
bleeding the immunized animal into a glass or plastic container, incubating the blood at 25**C 
for one hour, followed by incubating at A^'C for 2-18 hours. The serum is recovered by 
centrifiigation (e.g., l.QQOg for 10 minutes). About 20-50 ml per bleed may be obtained from 
rabbits. 

Monoclonal antibodies are prepared using the standard method of Kohler & Milstein 
{Nature (1975) 256:495-96), or a modification thereof. Typically, a mouse or rat is 
immunized as described above. However, rather than bleeding the animal to extract serum, 
the spleen (and optionally several large lymph nodes) is removed and dissociated into single 
cells. If desired, the spleen cells may be screened (after removal of nonspecifically adherent 
cells) by applying a cell suspension to a plate or well coated with the protein antigen. B-cells 
that express membrane-bound immunoglobulin specific for the antigen bind to the plate, and 
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are not rinsed away with the rest of the suspension. Resulting B-cells, or all dissociated spleen 
cells, are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a 
selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, "HAT"). The 
resulting hybridomas are plated by limiting dilution, and are assayed for the production of 
antibodies which bind specifically to the immunizing antigen (and which do not bind to 
unrelated antigens). The selected MAb-secreting hybridomas are then cultured either in vitro 
(e.g., in tissue culture bottles or hollow fiber reactors), or in vivo (as ascites in mice). 

If desired, the antibodies (whether polyclonal or monoclonal) may be labeled using 
conventional techniques. Suitable labels include fluorophores, chromophores, radioactive 
atoms (particularly ^^P and *^^I), electron-dense reagents, enzymes, and ligands having 
specific binding partners. Enzymes are typically detected by their activity. For example, 
horseradish peroxidase is usually detected by its ability to convert 
3,3*,5,5'-tetramethylbenzidine (TMB) to a blue pigment, quantifiable with a 
spectrophotometer. "Specific binding partner" refers to a protein capable of binding a ligand 
molecule with high specificity, as for example in the case of an antigen and a monoclonal 
antibody specific therefor. Other specific binding partners include biotin and avidin or 
streptavidin, IgG and protein A, and the niunerous receptor-Iigand couples known in the art. It 
should be understood that the above description is not meant to categorize the various labels 
into distinct classes, as the same label may serve in several different modes. For example, ^^^I 
may serve as a radioactive label or as an electron-dense reagent. HRP may serve as enzyme or 
as antigen for a MAb. Further, one may combine various labels for desired effect. For 
example, MAbs and avidin also require labels in the practice of this invention: thus, one 
might label a MAb with biotin, and detect its presence with avidin labeled with *^^I, or with 
an anti-biotin MAb labeled with HRP. Other permutations and possibiUties will be readily 
apparent to those of ordinary skill in the art, and are considered as equivalents within the 
scope of the instant invention. 

Antigens, immunogens, polypeptides, proteins or protein firagments of the present 
invention elicit formation of specific binding partner antibodies. These antigens, 
immunogens, polypeptides, proteins or protein Augments of the present invention comprise 
immunogenic compositions of the present invention. Such immunogenic compositions may 
further comprise or include adjuvants, carriers, or other compositions that promote or enhance 
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or stabilize the antigens, polypeptides, proteins or protein fragments of the present invention. 
Such adjuvants and carriers will be readily apparent to those of ordinary skill in the art. 

Pharmaceutical Compositions 

Pharmaceutical compositions can comprise (include) either polypeptides, antibodies, 
or nucleic acid of the invention. The pharmaceutical compositions will comprise a 
therapeutically effective amount of either polypeptides, antibodies, or polynucleotides of the 
claimed invention. 

The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 
detectable therapeutic or preventative effect. The effect can be detected by, for example, 
chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature, when given to a patient that is febrile. The 
precise effective amount for a subject will depend upon the subject's size and health, the 
nature and extent of the condition, and the therapeutics or combination of therapeutics 
selected for administration. Thus, it is not useful to specify an exact effective amount in 
advance. However, the effective amount for a given situation can be determined by routine 
experimentation and is within the judgment of the clinician. 

For purposes of the present invention, an effective dose will be from about 0.01 mg/ 
kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in the individual to 
which it is administered. 

A pharmaceutical composition can also contain a pharmaceutically acceptable cairier. 
The term "pharmaceutically acceptable carrier'' refers to a carrier for administration of a 
therapeutic agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. 
The term refers to any pharmaceutical carrier that does not itself induce the production of 
antibodies harmful to the individual receiving the composition, and which may be 
administered without undue toxicity. Suitable carriers may be large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 
polymeric amino acids, amino acid copolymers, and inactive virus particles. Such carriers are 
well known to those of ordmary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts 
such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of 
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organic acids such as acetates, propionates, malonates, benzoates, and the Hke. A thorough 
discussion of pharmaceutically acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., NJ. 1991). 

Pharmaceutically acceptable carriers in therapeutic compositions may contain liquids 
such as water, saline, glycerol and ethanol. Additionally, auxiliary substances, such as wetting 
or emulsifying agents, pH buffering substances, and the like, may be present in such vehicles. 
Typically, the therapeutic compositions are prepared as injectables, either as liquid solutions 
or suspensions; solid forms suitable for solution in, or suspension in, Uquid vehicles prior to 
injection may also be prepared. Liposomes are included within the definition of a 
pharmaceutically acceptable carrier. 

Delivery Methods 

Once formulated, the compositions of the invention can be administered directly to the 
subject. The subjects to be treated can be animals; in particular, human subjects can be 
treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneally, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a lesion. Other 
modes of administration include oral and pulmonary administration, suppositories, and 
transdermal and transcutaneous applications, needles, and gene guns or hyposprays. Dosage 
treatment may be a single dose schedule or a multiple dose schedule. 

Vaccines 

Vaccines according to the invention may either be prophylactic (i.e., to prevent 
infection) or therapeutic (i.e., to treat disease after infection). 

Such vaccines comprise immunizing antigen(s) or inmnmogen(s), immunogenic 
polypeptide, protein(s) or protein Jfragments, or nucleic acids (e.g., ribonucleic acid or 
deoxyribonucleic acid), usually in combination with "pharmaceutically acceptable carriers," 
which include any carrier that does not itself induce the production of antibodies harmful to 
the individual receiving the composition. Suitable carriers are typically large, slowly 
metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic 
acids, polymeric amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or 
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liposomes), and inactive virus particles. Such carriers are well known to those of ordinary 
skill in the art. Additionally, these carriers may function as immunostimulating agents 
("adjuvants"). Furthermore, the immunogen or antigen may be conjugated to a bacterial 
toxoid, such as a toxoid from diphtheria, tetanus, cholera, H. pylori, etc. pathogens. 

Preferred adjuvants to enhance effectiveness of the composition include, but are not 
limited to: (1) aluminum salts (alum), such as aluminum hydroxide, aluminum phosphate, 
aluminum sulfate, etc; (2) oil-in-water emulsion formulations (with or without other specific 
immunostimulating agents such as muramyl peptides (see below) or bacterial cell wall 
components), such as for example (a) MF59 (PCX Publ. No. WO 90/14837), containing 5% 
Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally containing various amounts of 
MTP-PE (see below), although not required) formulated into submicron particles using a 
microfluidizer such as Model 1 lOY microfluidizer (Microfluidics, Newton, MA), (b) SAF, 
containing 10% Squalane, 0.4% Tween 80, 5% pluronic-blocked polymer L121, and thr-MDP 
(see below) either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion, and (c) Ribi™ adjuvant system (RAS), (Ribi Immunochem, Hamilton, 
MT) containing 2% Squalene, 0.2% Tween 80, and one or more bacterial cell wall 
components from the group consisting of monophosphorylipid A (MPL), trehalose 
dimycolate (TDM), and cell wall skeleton (CWS), preferably MPL + CWS (Detox™); 

(3) saponin adjuvants, such as Stimulon™ (Cambridge Bioscience, Worcester, MA) may be 
used or particles generated therefrom such as ISCOMs (immunostimulating complexes); 

(4) Complete Freund's Adjuvant (CFA) and Incomplete Freund's Adjuvant (IF A); 

(5) cytokines, such as interleukins (e.g., IL-1, IL-2, IL-4, IL-5, IL-6, IL-7, IL-12, etc.), 
interferons (e.g., gamma interferon), macrophage colony stimulating factor (M-CSF), tumor 
necrosis factor (TNF), etc; (6) detoxified mutants of a bacterial ADP-ribosylating toxin such 
as a cholera toxin (CT), a pertussis toxin (PT), or an E. coU heat-labile toxin (LT), particularly 
LT-K63, LT-R72, CT-S109, PT.K9/G129; see, e.g., WO 93/13302 and WO 92/19265; and 
(7) other substances that act as immunostimulating agents to enhance the effectiveness of the 
composition. Alum and MF59 are preferred. 

As mentioned above, muramyl peptides include, but are not limited to, N-acetyl- 
muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-normuramyl-L-alanyl-D- 
isoglutamine (nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(r-2- 
dipalmitoyl-5/2-glycero-3-huydroxyphosphoryloxy)-ethylamine (MTP-PE), etc. 
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The vaccine compositions comprising immunogenic compositions (e.g., which may 
include the antigen, pharmaceutically acceptable carrier, and adjuvant) typically will contain 
diluents, such as water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, such 
as wetting or emulsifying agents, pH buffering substances, and the like, may be present in 
such vehicles. Alternatively, vaccine compositions comprising immunogenic compositions 
may comprise an antigen, polypeptide, protein, protein fragment or nucleic acid in a 
pharmaceutically acceptable carrier. 

More specifically, vaccines comprising immunogenic compositions comprise an 
immunologically effective amount of the immunogenic polypeptides, as well as any other of 
the above-mentioned components, as needed. By "immxmologically effective amoxmt", it is 
meant that the administration of that amount to an individual, either in a single dose or as part 
of a series, is effective for treatment or prevention. This amount varies depending upon the 
health and physical condition of the individual to be treated, the taxonomic group of 
individual to be treated (e.g., nonhuman primate, primate, etc,\ the capacity of the 
individual's immune system to synthesize antibodies, the degree of protection desired, the 
formulation of the vaccine, the treating doctor's assessment of the medical situation, and other 
relevant factors. It is expected that the amount will fall in a relatively broad range that can be 
determined through routine trials. 

Typically, the vaccine compositions or immunogenic compositions are prepared as 
injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or 
suspension in, liquid vehicles prior to injection may also be prepared. The preparation also 
may be emulsified or encapsulated in liposomes for enhanced adjuvant effect, as discussed 
above under pharmaceutically acceptable carriers. 

The immunogenic compositions are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Additional formulations suitable for other 
modes of administration include oral and pulmonary formulations, suppositories, and 
transdermal and transcutaneous applications. Dosage treatment may be a single dose schedule 
or a multiple dose schedule. The vaccine may be administered in conjunction with other 
immunoregulatory agents. 

As an alternative to protein-based vaccines, DNA vaccination may be employed (e.g., 
Robinson & Torres (1997) Seminars in Immunology 9:271-283; Donnelly et al (1997) Annu 
Rev Immunol 15:617-648). 
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Gene Delivery Vehicles 

Gene therapy vehicles for delivery of constructs, including a coding sequence of a 
therapeutic of the invention, to be delivered to the mammal for expression in the mammal, can 
be administered either locally or systemically. These constructs can utilize viral or non-viral 
vector approaches in in vivo or ex vivo modality. Expression of such coding sequence can be 
induced using endogenous mammaUan or heterologous promoters. Expression of the coding 
sequence in vivo can be either constitutive or regulated. 

The invention includes gene delivery vehicles capable of expressing the contemplated 
nucleic acid sequences. The gene delivery vehicle is preferably a viral vector and, more 
preferably, a retroviral, adenoviral, adeno-associated viral (AAV), herpes viral, or alphavirus 
vector. The viral vector can also be an astrovirus, coronavims, orthomyxovirus, papovavirus, 
paramyxovirus, parvovirus, picomavims, poxvirus, or togavirus viral vector. See generally, 
Jolly (1994) Cancer Gene Therapy 1:51-64; Kimura (1994) Human Gene Therapy 5:845-852; 
Connelly (1995) Human Gene Therapy 6:185-193; and Kaplitt (1994) Nature Genetics 
6:148-153. 

Retroviral vectors are well known in the art, including B, C and D type retroviruses, 
xenotropic retroviruses (for example, NZB-Xl, NZB-X2 and NZB9-1 (see O'Neill (1985) J. 
ViroL 53:160) polytropic retroviruses e.g., MCF and MCF-MLV (see Kelly (1983) J. Virol. 
45:291), spumaviruses and lentiviruses. See RNA Tumor Viruses, Second Edition, Cold 
Spring Harbor Laboratory, 1985. 

Portions of the retroviral gene therapy vector may be derived from different 
retroviruses. For example, retrovector LTRs may be derived from a Murine Sarcoma Vims, a 
tRNA binding site from a Rous Sarcoma Virus, a packaging signal from a Murine Leukemia 
Virus, and an origin of second strand synthesis from an Avian Leukosis Virus. 

These recombinant retroviral vectors may be used to generate transduction competent 
retroviral vector particles by introducing them into appropriate packaging cell lines (see US 
patent 5,591,624). Retrovirus vectors can be constructed for site-specific integration into host 
cell DNA by incorporation of a chimeric integrase enzyme into the retroviral particle (see 
W096/37626). It is preferable that the recombinant viral vector is a replication defective 
recombinant virus. 
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Packaging cell lines suitable for use with the above-described retrovirus vectors are 
well known in the art, are readily prepared (see WO95/30763 and WO92/05266), and can be 
used to create producer cell lines (also termed vector cell lines or "VCLs") for the production 
of recombinant vector particles. Preferably, the packaging cell lines are made from human 
parent cells (e.g., HT1080 cells) or mink parent cell Unes, which eliminates inactivation in 
human serum. 

Preferred retroviruses for the construction of retroviral gene therapy vectors include 
Avian Leukosis Virus, Bovine Leukemia, Virus, Murine Leukemia Virus, Mink-Cell 
Focus-Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis Virus and Rous 
Sarcoma Vims. Particularly preferred Murine Leukemia Viruses include 4070A and 1504A 
(Hartley and Rowe (1976) J Virol 19:19-25), Abelson (ATCC No. VR-999), Friend (ATCC 
No. VR-245), Graflfi, Gross (ATCC Nol VR-590), Kirsten, Harvey Sarcoma Virus and 
Rauscher (ATCC No. VR-998) and Moloney Murine Leukemia Virus (ATCC No. VR-1 90). 
Such retroviruses may be obtained from depositories or collections such as the American 
Type Culture Collection ("ATCC") m Rockville, Maryland or isolated from known sources 
using commonly available techniques. 

Exemplary knovra retroviral gene therapy vectors employable in this invention include 
those described in patent applications GB2200651, EP0415731, EP0345242, EP0334301, 
WO89/02468; WO89/05349, WO89/09271, WO90/02806, WO90/07936, WO94/03622, 
W093/25698, W093/25234, WO93/11230, WO93/10218, WO91/02805, WO91/02825, 
WO95/07994, US 5,219,740, US 4,405,712, US 4,861,719, US 4,980,289, US 4,777,127, US 
5,591,624. See also Vile (1993) Cancer Res 53:3860-3864; Vile (1993) Cancer Res 
53:962-967; Ram (1993) Cancer Res 53 (1993) 83-88; Takamiya {1992) J Neurosci Res 
33:493-503; Baba (1993) JNeurosurg 79:729-735; Mann (1983) Cell 33:153; Cane (1984) 
Proc Natl Acad Sci 81:6349; and Miller (1990) Human Gene Therapy 1. 

Human adenoviral gene therapy vectors are also known in the art and employable in 
this invention. See, for example, Berkner (1988) Biotechniques 6:616 and Rosenfeld (1991) 
Science 252:431, and WO93/07283, WO93/06223, and WO93/07282. Exemplary known 
adenoviral gene therapy vectors employable in this invention include those described in the 
above referenced documents and in W094/12649, WO93/03769, W093/19191, 
W094/28938, W095/11984. WO95/00655, WO95/27071, W095/29993, W095/34671, 
WO96/05320, WO94/08026, WO94/11506, WO93/06223, W094/24299, WO95/14102, 



wo 99/57280 




PCT/US99/09346 



39 



W095/24297, WO95/02697, W094/28152, W094/24299, WO95/09241, WO95/25807, 
WO95/05835, W094/18922 and WO95/09654. Alternatively, administration of DNA linked 
to killed adenovirus as described in Curiel (1992) Hum, Gene Ther 3:147-154 may be 
employed. The gene delivery vehicles of the invention also include adenovirus associated 
virus (AAV) vectors. Leading and preferred examples of such vectors for use in this invention 
are the AAV-2 based vectors disclosed in Srivastava, WO93/09239. Most preferred AAV 
vectors comprise the two AAV inverted terminal repeats in vi^hich the native D-sequences are 
modified by substitution of nucleotides, such that at least 5 native nucleotides and up to 1 8 
native nucleotides, preferably at least 10 native nucleotides up to 18 native nucleotides, most 
preferably 10 native nucleotides are retained and the remaining nucleotides of the D-sequence 
are deleted or replaced with non-native nucleotides. The native D-sequences of the AAV 
inverted terminal repeats are sequences of 20 consecutive nucleotides in each AAV inverted 
terminal repeat (i.e., there is one sequence at each end) which are not involved in HP 
formation. The non-native replacement nucleotide may be any nucleotide other than the 
nucleotide found in the native D-sequence in the same position. Other employable exemplary 
AAV vectors are pWP-19, pWN-1, both of which are disclosed in Nahreini (1993) Gene 
124:257-262. Another example of such an AAV vector is psub201 (see Samulski (1987) J. 
Virol 61 :3096). Another exemplary AAV vector is the Double-D ITR vector. Construction of 
the Double-D ITR vector is disclosed in US Patent 5,478,745. Still other vectors are those 
disclosed in Carter US Patent 4,797,368 and Muzyczka US Patent 5,139,941, Chartejee US 
Patent 5,474,935, and Kotin W094/288157. Yet a further example of an AAV vector 
employable in this invention is SSV9AFABTKneo, which contains the AFP enhancer and 
albimiin promoter and directs expression predominantly in the liver. Its structure and 
construction are disclosed in Su (1996) Human Gene Therapy 7:463-470. Additional AAV 
gene therapy vectors are described in US 5,354,678, US 5,173,414, US 5,139,941, and US 
5,252,479. 

The gene therapy vectors comprising sequences of the invention also include herpes 
vectors. Leading and preferred examples are herpes simplex virus vectors containing a 
sequence encoding a thymidine kinase polypeptide such as those disclosed in US 5,288,641 
and EP0176170 (Roizman). Additional exemplary herpes simplex virus vectors include 
HFEM/ICP6-LacZ disclosed in WO95/04139 (Wistar Institute), pHSVlac described in Geller 
(1988) Science 241 :1667-1669 and in WO90/09441 and WO92/07945, HSV Us3::pgC-lacZ 
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described in Fink (1992) Human Gene Therapy 3:1 1-19 and HSV 7134, 2 RH 105 and GAL4 
described in EP 0453242 (Breakefield), and those deposited with the ATCC as accession 
numbers ATCC VR-977 and ATCC VR-260. 

Also contemplated are alpha virus gene therapy vectors that can be employed in this 
invention. Preferred alpha virus vectors are Sindbis viruses vectors. Togaviruses, Semliki 
Forest virus (ATCC VR-67; ATCC VR.1247), Middleberg virus (ATCC VR.370), Ross 
River virus (ATCC VR-373; ATCC VR-1246), Venezuelan equine encephaUtis virus (ATCC 
VR923; ATCC VR-1250; ATCC VR-1249; ATCC VR-532), and those described in US 
patents 5,091,309, 5,217,879, and WO92/10578. More particularly, those alpha virus vectors 
described in U.S. Serial No. 08/405,627, filed March 15, 1995,W094/21792, WO92/10578, 
WO95/07994, US 5,091 ,309 and US 5,217,879 are employable. Such alpha viruses may be 
obtained from depositories or collections such as the ATCC in Rockville, Maryland or 
isolated from known sources using commonly available techniques. Preferably, alphavirus 
vectors with reduced cytotoxicity are used (see USSN 08/679640). 

DNA vector systems such as eukarytic layered expression systems are also useful for 
expressing the nucleic acids of the invention. SeeWO95/07994 for a detailed description of 
eukaryotic layered expression systems. Preferably, the eukaryotic layered expression systems 
of the invention are derived from alphavirus vectors and most preferably from Sindbis viral 
vectors. 

Other viral vectors suitable for use in the present invention include those derived from 
poliovirus, for example ATCC VR-58 and those described in Evans, Nature 339 (1989) 385 
and Sabin (1973) 1 BioL Standardization 1:115; rhino virus, for example ATCC VR-1 1 10 and 
those described in Arnold (1990) J Cell Biochem L401; pox viruses such as canary pox virus 
or vaccinia virus, for example ATCC VR-l 1 1 and ATCC VR-2010 and those described in 
Fisher-Hoch (1989) Proc Natl Acad Sci 86:317; Flexner (1989) Ann NY Acad Sci 569:86, 
Flexner (1990) Vaccine 8:17; in US 4,603,112 and US 4,769,330 and WO89/01973; SV40 
virus, for example ATCC VR-305 and those described in Mulligan (1979) Nature 277:108 
and Madzak (1992) J Gen Virol 73: 1533; influenza virus, for example ATCC VR-797 and 
recombinant influenza viruses made employing reverse genetics techniques as described in 
US 5,166,057 and in Enami {1990) Proc Natl Acad Sci 87:3802-3805; Enami & Palese (1991) 
J Virol 65:271 1-2713 and Luytjes (1989) Cell 59:1 10, (see also McMichael (1983) NEJMed 
309:13, and Yap (1978) Nature 273:238 and Nature (1979) 277:108); human 
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immunodeficiency virus as described in EP-0386882 and in Buchschacher (1992) J. ViroL 
66:2731; measles virus, for example ATCC VR-67 and VR-1247 and those described in EP- 
0440219; Aura virus, for example ATCC VR-368; Bebaru virus, for example ATCC VR-600 
and ATCC VR-1240; Cabassou virus, for example ATCC VR-922; Chikungunya virus, for 
example ATCC VR-64 and ATCC VR-1241; Fort Morgan Virus, for example ATCC 
VR-924; Getah virus, for example ATCC VR-369 and ATCC VR-1243; Kyzylagach virus, 
for example ATCC VR-927; Mayaro virus, for example ATCC VR-66; Mucambo virus, for 
example ATCC VR-580 and ATCC VR-1244; Ndumu virus, for example ATCC VR-371; 
Pixuna virus, for example ATCC VR-372 and ATCC VR-1245; Tonate virus, for example 
ATCC VR-925; Triniti virus, for example ATCC VR-469; Una virus, for example ATCC 
VR-374; Whataroa virus, for example ATCC VR-926; Y-62-33 virus, for example ATCC 
VR-375; 0*Nyong virus, Eastem encephalitis virus, for example ATCC VR-65 and ATCC 
VR-1242; Western encephalitis virus, for example ATCC VR-70, ATCC VR-1251, ATCC 
VR-622 and ATCC VR-1252; and coronavirus, for example ATCC VR-740 and those 
described in Hamre (1966) Proc Soc Exp Biol Med 121:190. 

Delivery of the compositions of this invention into cells is not limited to the above 
mentioned viral vectors. Other delivery methods and media may be employed such as, for 
example, nucleic acid expression vectors, polycationic condensed DNA linked or unlinked to 
killed adenovirus alone, for example see US Serial No. 08/366,787, filed December 30, 1994 
and Curiel (1992) Hum Gene Ther 3:147-154 ligand linked DNA, for example see Wu (1989) 
J Biol Chem 264:16985-16987, eucaryotic cell delivery vehicles cells, for example see US 
Serial No.08/240,030, filed May 9, 1994, and US Serial No. 08/404,796, deposition of 
photopolymerized hydrogel materials, hand-held gene transfer particle gun, as described in 
US Patent 5,149,655, ionizing radiation as described in US5,206,152 and in W092/1 1033, 
nucleic charge neutralization or fiision with cell membranes. Additional approaches are 
described in Philip (1994) Mol Cell Biol 14:241 1-2418 and in Woffendin (1994) Proc Natl 
^caJ 5a 91:1581-1585. 

Particle mediated gene transfer may be employed, for example see US Serial No. 
60/023,867. Briefly, the sequence can be inserted into conventional vectors that contain 
conventional control sequences for high level expression, and then incubated with synthetic 
gene transfer molecules such as polymeric DNA-binding cations like polylysine, protamine, 
and albumin, linked to cell targeting ligands such as asialoorosomucoid, as described in Wu & 
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Wu (1987) J. BioL Chem. 262:4429-4432. insulin as described in Hucked (1990) Biochem 
Pharmacol 40:253-263, galactose as described in Plank (1992) Bioconjugate Chem 
3:533-539, lactose or transferrin. 

Naked DNA may also be employed to transform a host cell. Exemplary naked DNA 
introduction methods are described in WO 90/1 1092 and US 5,580,859. Uptake efficiency 
may be improved using biodegradable latex beads. DNA coated latex beads are efficiently 
transported into cells after endocytosis initiation by the beads. The method may be improved 
further by treatment of the beads to increase hydrophobicity and thereby facilitate disruption 
of the endosome and release of the DNA into the cytoplasm. 

Liposomes that can act as gene delivery vehicles are described in U.S. 5,422,120, 
W095/13796, W094/23697, W091/14445 and EP-524,968. As described in USSN. 
60/023,867, on non-viral delivery, the nucleic acid sequences encoding a polypeptide can be 
inserted into conventional vectors that contain conventional control sequences for high level 
expression, and then be incubated with synthetic gene transfer molecules such as polymeric 
DNA-binding cations like polylysine, protamine, and albumin, linked to cell targeting ligands 
such as asialoorosomucoid, insulin, galactose, lactose, or transferrin. Other delivery systems 
include the use of liposomes to encapsulate DNA comprising the gene under the control of a 
variety of tissue-specific or ubiquitously-active promoters. Further non- viral delivery suitable 
for use includes mechanical delivery systems such as the approach described in Woffendin et 
al (1994) Proc. Natl, Acad. Set USA 91(24):1 1581-1 1585. Moreover, the coding sequence 
and the product of expression of such can be delivered through deposition of 
photopolymerized hydrogel materials. Other conventional methods for gene delivery that can 
be used for delivery of the coding sequence include, for example, use of hand-held gene 
transfer particle gun, as described in U.S. 5,149,655; use of ionizing radiation for activating 
transferred gene, as described in U.S. 5,206,152 and W092/1 1033. 

Exemplary liposome and polycationic gene delivery vehicles are those described in 
US 5,422,120 and 4,762,915; inWO 95/13796; W094/23697; and W091/14445; in EP- 
0524968; and in Stiyer, Biochemistry, pages 236-240 (1975) W.H. Freeman, San Francisco; 
Szoka (1980) Biochem Biophys Acta 600:1 ; Bayer (1979) Biochem Biophys Acta 550:464; 
Rivnay (1987) Meth Enzymol 149:1 19; Wang (1987) Proc Natl Acad Sci 84:7851; Plant 
{\9%9) Anal Biochem 176:420. 
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A polynucleotide composition can comprises therapeutically effective amount of a 
gene therapy vehicle, as the term is defined above. For purposes of the present invention, an 
effective dose will be &om about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg 
of the DNA constructs in the individual to which it is administered. 

Delivery Methods 

Once formulated, the polynucleotide compositions of the invention can be 
administered (1) directly to the subject; (2) delivered ex vivo, to cells derived from the 
subject; or (3) in vitro for expression of recombinant proteins. The subjects to be treated can 
be mammals or birds. Also, human subjects can be treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneally, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a tumor or lesion. 
Other modes of administration include oral and pubnonary administration, suppositories, and 
transdermal applications, needles, and gene guns or hyposprays. Dosage treatment may be a 
single dose schedule or a multiple dose schedule. 

Methods for the ex vivo delivery and reimplantation of transformed cells into a subject 
are known in the art and described in eg, W093/14778. Examples of cells useful in ex vivo 
applications include, for example, stem cells, particularly hematopoetic, lymph cells, 
macrophages, dendritic cells, or timior cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
accomplished by the following procedures, for example, dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct microinjection 
of the DNA into nuclei, all well known in the art. 

Polynucleotide and polypeptide pharmaceutical compositions 

In addition to the pharmaceutically acceptable carriers and salts described above, the 
following additional agents can be used with polynucleotide and/or polypeptide compositions. 
A.Polypeptides 

One example are polypeptides which include, without limitation: asioloorosomucoid 
(ASOR); transferrin; asialoglycoproteins; antibodies; antibody fragments; ferritin; 
interieukins; interferons, granulocyte, macrophage colony stimulating factor (GM-CSF), 
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granulocyte colony stimulating factor (G-CSF), macrophage colony stimulating factor 
(M-CSF), stem cell factor and erythropoietin. Viral antigens, such as envelope proteins, can 
also be used. Also, proteins from other invasive organisms, such as the 17 amino acid peptide 
from the circumsporozoite protein of Plasmodium falciparum known as RII. 

B. Hormones, Vitamins, Etc. 

Other groups that can be included are, for example: hormones, steroids, androgens, 
estrogens, thyroid hormone, or vitamins, folic acid. 

C. Polyalkylenes, Polysaccharides, etc. 

Also, polyalkylene glycol can be included with the desired polynucleotides or 
polypeptides. In a preferred embodiment, the polyalkylene glycol is polyethlylene glycol. In 
addition, mono-, di-, or polysaccarides can be included. In a preferred embodunent of this 
aspect, the polysaccharide is dextran or DEAE-dextran. Also, chitosan and 
poly(lactide-co-glycolide) 

D. Lipids, and Liposomes 

The desired polynucleotide or polypeptide can also be encapsulated in lipids or 
packaged in liposomes prior to delivery to the subject or to cells derived therefrom. 

Lipid encapsulation is generally accomplished using liposomes which are able to 
stably bind or entrap and retain nucleic acid. The ratio of condensed polynucleotide or 
polypeptide to lipid preparation can vary but will generally be around 1:1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as carriers for 
delivery of nucleic acids, see, Hug and Sleight (1991) Biochim. Biophys. Acta, 1097:1-17; 
Straubinger(1983)Me/A, Enzymol 101:512-527. 

Liposomal preparations for use in the present invention include cationic (positively 
charged), anionic (negatively charged) and neutral preparations, Cationic liposomes have 
been shown to mediate intracellular delivery of plasmid DNA (Feigner (1987) Proc. Natl 
Acad, Sci. USA 84:7413-7416); mRNA (Malone (1989) Proc. Natl, Acad, ScL USA 
86:6077-6081); and purified transcription factors (Debs (1990) /. Biol. Chem. 
265:10189-10192), in functional form. 

Cationic liposomes are readily available. For example, 
N[l-2,3-dioIeyloxy)propyl]-N,N,N-triethylammonium (DOTMA) liposomes are available 
under the trademark Lipofectin, from GBCO BRL, Grand Island, NY. (See, also, Feigner 
supra). Other commercially available liposomes include transfectace (DDAB/DOPE) and 
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DOTAP/DOPE (Boerhinger). Other cationic liposomes can be prepared from readily available 
materials using techniques well known in the art. See, eg, Szoka (1978) Proc. Natl. Acad. Sci. 
USA 75:4194-4198; WO90/1 1092 for a description of the synthesis of DOTAP 
(1 ,2-bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. 

Similarly, anionic and neutral Uposomes are readily available, such as from Avanti 
Polar Lipids (Birmingham, AL), or can be easily prepared using readily available materials. 
Such materials include phosphatidyl choline, cholesterol, phosphatidyl ethanolamine, 
dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), 
dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can also be mixed 
with the DOTMA and DOTAP starting materials in appropriate ratios. Methods for making 
liposomes using these materials are well known in the art. 

The liposomes can comprise multilammelar vesicles (MLVs), small unilamellar 
vesicles (SUVs), or large unilamellar vesicles (LUVs). The various liposome-nucleic acid 
complexes are prepared using methods known in the art. See eg, Straubinger (1983) Meth, 
Immunol 101:512-527; Szoka (1978) Proc. Natl Acad. Set USA 75:4194-4198; 
Papahadjopoulos (1975) Biochim. Biophys. Acta 394:483; Wilson (1979) Cell Deamer 
& Bangham (1976) Biochim. Biophys. Acta 443:629; Ostro (1977) Biochem. Biophys. Res. 
Commun. 76:836; Fraley (1979) Proc. Natl Acad. Sci. USA 76:3348); Enoch & Strittmatter 
(1979) Proc. Natl Acad ScL USA 76:145; Fraley (1980) J. Biol Chem, (1980) 255:10431; 
Szoka & Papahadjopoulos (1978) Proc. Natl Acad. Sci. USA 75:145; and Schaefer-Ridder 
{1982) Science 215:166. 
E.Lipoproteins 

In addition, lipoproteins can be included with the polynucleotide or polypeptide to be 
delivered. Examples of lipoproteins to be utilized include: chylomicrons, HDL, IDL, LDL, 
and VLDL. Mutants, fragments, or fusions of these proteins can also be used. Also, 
modifications of naturally occurring lipoproteins can be used, such as acetylated LDL. These 
lipoproteins can target the delivery of polynucleotides to cells expressing lipoprotein 
receptors. Preferably, if lipoproteins are including with the polynucleotide to be delivered, no 
other targeting ligand is included in the composition. 

Naturally occurring lipoproteins comprise a lipid and a protein portion. The protein 
portion are known as apoproteins. At the present, apoproteins A, B, C, D, and E have been 
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isolated and identified. At least two of these contain several proteins, designated by Roman 
numerals, AI, AH, AIV; CI, CII, CEL 

A lipoprotein can comprise more than one apoprotein. For example, naturally 
occurring chylomicrons comprises of A, B, C, and E, over time these lipoproteins lose A and 
acquire C and E apoproteins. VLDL comprises A, B, C, and E apoproteins, LDL comprises 
apoprotein B; and HDL comprises apoproteins A, C, and E. 

The amino acid of these apoprotems are known and are described in, for example, 
Breslow (1985) Annu Rev. Biochem 54:699; Law (1986) Adv, Exp Med. Biol 151:162; Chen 
(1986)75/0/ Chem 261:12918; Kane {mO) Proc Natl Acad Sci USA 77:2465; and Utermann 
(\9U) Hum Genet 65:232. 

Lipoproteins contain a variety of lipids including, triglycerides, cholesterol (free and 
esters), and phopholipids. The composition of the lipids varies in naturally occurring 
lipoproteins. For example, chylomicrons comprise mamly triglycerides. A more detailed 
description of the lipid content of naturally occurring lipoproteins can be found, for example, 
mMeth, Enzymol 128 (1986). The composition of the lipids are chosen to aid in 
conformation of the apoprotein for receptor binding activity. The composition of lipids can 
also be chosen to facilitate hydrophobic interaction and association with the polynucleotide 
binding molecule. 

Naturally occurring lipoproteins can be isolated from serum by ultracentrifiigation, for 
instance. Such methods are described in Meth. Enzymol (supra); Pitas (1980) J. Biochem. 
255:5454-5460 and Mahey (1979) J Clin. Invest 64:743-750. 

Lipoproteins can also be produced by in vitro or recombinant methods by expression 
of the apoprotein genes in a desired host cell. See, for example, Atkinson (1986) Annu Rev 
Biophys Chem 1 5:403 and Radding (1958) Biochim Biophys Acta 30: 443. 

Lipoproteins can also be purchased from commercial supphers, such as Biomedical 
Techniologies, Inc., Stoughton, Massachusetts, USA. 

Further description of lipoproteins can be found in Zuckermann et al., PCT, Appln. 
NO.US97/14465. 
F.Polycationic Agents 

Polycationic agents can be included, with or without lipoprotein, in a composition 
with the desired polynucleotide or polypeptide to be delivered. 
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Polycationic agents, typically, exhibit a net positive charge at physiological relevant 
pH and are capable of neutralizing the electrical charge of nucleic acids to facilitate delivery 
to a desired location. These agents have both in vitro, ex vivo, and in vivo applications. 
Polycationic agents can be used to deliver nucleic acids to a living subject either 
intramuscularly, subcutaneously, etc. 

The following are examples of useful polypeptides as polycationic agents: polylysine, 
polyarginine, polyomithine, and protamine. Other examples include histones, protamines, 
human serum albumin, DNA binding proteins, non-histone chromosomal proteins, coat 
proteins from DNA viruses, such as (XI 74, transcriptional factors also contain domains that 
bind DNA and therefore may be useful as nucleic aid condensing agents. Briefly, 
transcriptional factors such as C/CEBP, c-jun, c-fos, AP-1, AP-2, AP-3, CPF, Prot-1, Sp-1, 
Oct-1, Oct-2, CREP, and TFIID contain basic domains that bind DNA sequences. 

Organic polycationic agents include: spermine, spermidine, and purtrescine. 

The dimensions and of the physical properties of a polycationic agent can be 
extrapolated from the list above, to construct other polypeptide polycationic agents or to 
produce synthetic polycationic agents. 

Synthetic Polycationic Agents 

Synthetic polycationic agents which are useful include, for example, DEAE-dextran, 
polybrene. LipofectinD, and lipofectAMINED are monomers that form polycationic 
complexes when combined with polynucleotides or polypeptides. 

Immunodiagnostic Assays 

Neisserial antigens of the invention can be used in immunoassays to detect antibody 
levels (or, conversely, anti-Neisserial antibodies can be used to detect antigen levels). 
Immunoassays based on well defined, recombinant antigens can be developed to replace 
invasive diagnostics methods. Antibodies to Neisserial proteins within biological samples, 
including for example, blood or serum samples, can be detected. Design of the immunoassays 
is subject to a great deal of variation, and a variety of these are known in the art. Protocols for 
the immunoassay may be based, for example, upon competition, or direct reaction, or 
sandwich type assays. Protocols may also, for example, use solid supports, or may be by 
immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; the 
labels may be, for example, fluorescent, chemiluminescent, radioactive, or dye molecules. 
Assays which amplify the signals from the probe are also known; examples of which are 
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assays which utilize biotin and avidin, and enzyme-labeled and mediated inmnmoassays, such 
as ELISA assays. 

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are 
constructed by packaging the ^propriate materials, including the compositions of the 
invention, in suitable containers, along with the remaining reagents and materials (for 
example, suitable buffers, salt solutions, etc) required for the conduct of the assay, as well as 
suitable set of assay instructions. 

Nucleic Acid Hybridisation 

"Hybridization" refers to the association of two nucleic acid sequences to one another 
by hydrogen bonding. Typically, one sequence will be fixed to a solid support and the other 
will be free in solution. Then, the two sequences will be placed in contact with one another 
under conditions that favor hydrogen bonding. Factors that affect this bonding include: the 
type and volume of solvent; reaction temperature; time of hybridization; agitation; agents to 
block the non-specific attachment of the liquid phase sequence to the solid support 
(Denhardt's reagent or BLOTTO); concentration of the sequences; use of compounds to 
increase the rate of association of sequences (dextran sulfate or polyethylene glycol); and the 
stringency of the washing conditions following hybridization. See Sambrook et al [supra] 
Volume 2, chapter 9, pages 9.47 to 9.57. 

"Stringency" refers to conditions in a hybridization reaction that favor association of 
very similar sequences over sequences that differ. For example, the combination of 
temperature and saU concentration should be chosen that is approximately 120 to 200DC 
below the calculated Tm of the hybrid under study. The temperature and salt conditions can 
often be determined empirically in preliminary experiments in which samples of genomic 
DNA immobilized on filters are hybridized to the sequence of interest and then washed under 
conditions of different stringencies. See Sambrook et al at page 9.50. 

Variables to consider when performing, for example, a Southem blot are (1) the 
complexity of the DNA being blotted and (2) the homology between the probe and the 
sequences being detected. The total amount of the fragment(s) to be studied can vary a 
magnitude of 1 0, from 0. 1 to 1 jig for a plasmid or phage digest to 1 0"^ to 1 0*^ g for a single 
copy gene in a highly complex eukaryotic genome. For lower complexity polynucleotides, 
substantially shorter blotting, hybridization, and exposure times, a smaller amount of starting 
polynucleotides, and lower specific activity of probes can be used. For example, a single-copy 
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yeast gene can be detected with an exposure time of only 1 hour starting with 1 jig of yeast 
DNA, blotting for two hours, and hybridizing for 4-8 hours with a probe of 10^ cpm/^g. For a 
single-copy mammalian gene a conservative q)proach would start v/ith 10 \ig of DNA, blot 
overnight, and hybridize overnight in the presence of 10% dextran sulfate using a probe of 
greater than 10® cpm/^ig, resulting in an exposure time of -24 hours. 

Several factors can affect the melting temperature (Tm) of a DNA-DNA hybrid 
between the probe and the fragment of mterest, and consequently, the appropriate conditions 
for hybridization and washing. In many cases the probe is not 100% homologous to the 
fragment. Other commonly encountered variables include the length and total G+C content of 
the hybridizing sequences and the ionic strength and fomiamide content of the hybridization 
buffer. The effects of all of these factors can be approximated by a single equation: 

Tm= 81 + 16.6(logioCi) + 0.4[%(G + C)]-0.6(%formamide) - 600//i-L5(%mismatch). 

where Ci is the salt concentration (monovalent ions) and n is the length of the hybrid 
in base pairs (slightly modified from Meinkoth & Wahl (1984) AnaL Biochem. 138: 267-284). 

In designing a hybridization experiment, some factors affecting nucleic acid 
hybridization can be conveniently altered. The temperature of the hybridization and washes 
and the salt concentration during the washes are the simplest to adjust. As the temperature of 
the hybridization increases (ie. stringency), it becomes less likely for hybridization to occur 
between strands that are nonhomologous, and as aresuh, background decreases. If the 
radiolabeled probe is not completely homologous with the immobilized fragment (as is 
frequently the case in gene family and interspecies hybridization experiments), the 
hybridization temperature must be reduced, and background will increase. The temperature of 
the washes affects the intensity of the hybridizing band and the degree of background in a 
similar manner. The stringency of the washes is also increased with decreasing salt 
concentrations. 

In general, convenient hybridization temperatures in the presence of 50% formamide 
are 42DC for a probe with is 95% to 100% homologous to the target fragment, 37DC for 90% 
to 95% homology, and 3200 for 85% to 90% homology. For lower homologies, formamide 
content should be lowered and temperature adjusted accordingly, using the equation above. If 
the homology between the probe and the target fragment are not known, the simplest 
approach is to start v/ith both hybridization and wash conditions which are nonstringent. If 
non-specific bands or high background are observed after autoradiography, the filter can be 
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washed at high stringency and reexposed. If the time required for exposure makes this 
approach impractical, several hybridization and/or washing stringencies should be tested in 
parallel. 

Nucleic Acid Probe Assays 

Methods such as PGR, branched DNA probe assays, or blotting techniques utilizing 
nucleic acid probes according to the invention can determine the presence of cDNA or 
mRNA. A probe is said to "hybridize" with a sequence of the invention if it can form a duplex 
or double stranded complex, which is stable enough to be detected. 

The nucleic acid probes will hybridize to the Neisserial nucleotide sequences of the 
invention (including both sense and antisense strands). Though many different nucleotide 
sequences will encode the amino acid sequence, die native Neisserial sequence is preferred 
because it is the actual sequence present in cells. mRNA represents a coding sequence and so 
a probe should be complementary to the coding sequence; single-stranded cDNA is 
complementary to mRNA, and so a cDNA probe should be complementary to the non-coding 
sequence. 

The probe sequence need not be identical to the Neisserial sequence (or its 
complement) — some variation in the sequence and length can lead to increased assay 
sensitivity if the nucleic acid probe can form a duplex with target nucleotides, which can be 
detected. Also, the nucleic acid probe can include additional nucleotides to stabilize the 
formed duplex. Additional Neisserial sequence may also be helpful as a label to detect the 
formed duplex. For example, a non-complementary nucleotide sequence may be attached to 
the 5' end of the probe, with the remainder of the probe sequence being complementary to a 
Neisserial sequence. Alternatively, non-complementary bases or longer sequences can be 
interspersed into the probe, provided that the probe sequence has sufficient complementarity 
with the a Neisserial sequence in order to hybridize therewith and thereby form a duplex 
which can be detected. 

The exact length and sequence of the probe will depend on the hybridization 
conditions, such as temperature, salt condition and the like. For example, for diagnostic 
applications, depending on the complexity of the analyte sequence, the nucleic acid probe 
typically contains at least 10-20 nucleotides, preferably 15-25, and more preferably at least 30 
nucleotides, although it may be shorter than this. Short primers generally require cooler 
temperatures to form sufficiently stable hybrid complexes with the template. 
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Probes may be produced by synthetic procedures, such as the triester method of 
Matteucci et al [1 Am. Chem. Soc. (1981) 103:3185], or according to Urdea et al [Proc. 
Natl Acad. ScL USA (1983) 80: 7461], or using conmiercially available automated 
oligonucleotide synthesizers. 

The chemical nature of the probe can be selected according to preference. For certain 
applications, DNA or RNA are appropriate. For other applications, modifications may be 
incorporated eg. backbone modifications, such as phosphorothioates or methylphosphonates, 
can be used to increase in vivo half-life, alter RNA affinity, increase nuclease resistance etc. 
[eg. see Agrawal & Iyer (1995) Curr Opin Biotechnol 6:12-19; Agrawal (1996) TIBTECH 
14:376-387]; analogues such as peptide nucleic acids may also be used [eg. see Corey (1997) 
TIBTECH 15:224-229; Buchardt etal (1993) T/BZECi/ 11:384-386]. 

One example of a nucleotide hybridization assay is described by Urdea et al in 
international patent application WO92/02526 [see also US patent 5,124,246]. 

Alternatively, the polymerase chain reaction (PCR) is another well-known means for 
detecting small amounts of target nucleic acids. The assay is described in: Mullis et al [Meth. 
Enzymol (1987) 155: 335-350]; US patent 4,683,195; and US patent 4,683,202. Two 
'"primer" nucleotides hybridize with the target nucleic acids and are used to prime the 
reaction. The primers can comprise sequence that does not hybridize to the sequence of the 
amplification target (or its complement) to aid with duplex stability or, for example, to 
incorporate a convenient restriction site. Typically, such sequence will flank the desired 
Neisserial sequence. 

A thermostable polymerase creates copies of target nucleic acids from the primers 
using the original target nucleic acids as a template. After a threshold amount of target nucleic 
acids are generated by the polymerase, they can be detected by more traditional methods, such 
as Southern blots. When using the Southern blot method, the labelled probe will hybridize to 
the Neisserial sequence (or its complement). 

Also, mRNA or cDNA can be detected by traditional blotting techniques described in 
Sambrook et al [supra]. mRNA, or cDNA generated from mRNA using a polymerase 
enzyme, can be purified and separated using gel electrophoresis. The nucleic acids on the gel 
are then blotted onto a solid support, such as nitrocellulose. The solid support is exposed to a 
labelled probe and then washed to remove any xmhybridized probe. Next, the duplexes 
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containing the labeled probe are detected. Typically, the probe is labelled with a radioactive 
moiety. 

EXAMPLES 

The examples describe nucleic acid sequences which have been identified in 
A'; meningitidis, and N. gonorrhoeae along with their respective and putative translation 
products. Not all of the nucleic acid sequences are complete ie. they encode less than the full- 
length wild-type protein. 

The examples are generally in the following format: 

• a nucleotide sequence which has been identified in K meningitidis 

• the putative translation product of said N. meningitidis sequence 

• a computer analysis of said translation product based on database comparisons 

• a corresponding nucleotide sequence identified fi-om N. gonorrhoeae 

• the putative translation product of said A^. gonorrhoeae sequence 

• a comparision of the percentage of identity between the translation product of the 
K meningitidis sequence and the K gonorrhoeae sequence. 

• a corresponding nucleotide sequence identified ft"om strain A of K meningitidis 

• the putative translation product of said K meningitidis strain A sequence 

• a comparision of the percentage of identity between the translation product of the 
K meningitidis sequence and the N. gonorrhoeae sequence. 

• a description of the characteristics of the protein which indicates that it might be 
suitably antigenic or immunogenic. 

Sequence comparisons were performed at NCBI (http://www.ncbi.nlm.nih.gov) using 
the algorithms BLAST, BLAST2, BLASTn, BLASTp, tBLASTn, BLASTx, & tBLASTx [eg 
see also Altschul et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs. Nucleic Acids Research 25:2289-3402]. Searches were performed 
against the following databases: non-redundant GenBank+EMBL+DDBJ+PDB sequences 
and non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR sequences. 

Dots within nucleotide sequences represent nucleotides which have been arbitrarily 
introduced in order to maintain a reading frame. In the same way, double-underlined 
nucleotides were removed . Lower case letters represent ambiguities which arose during 
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alignment of independent sequencing reactions (some of the nucleotide sequences in the 
examples are derived ftom combining the results of two or more experiments). 

Nucleotide sequences were scanned in all six reading frames to predict the presence of 
hydrophobic domains using an algorithm based on the statistical studies of Esposti et al 
[Critical evaluation of the hydropathy of membrane proteins (1990) EurJBiochem 190:207- 
219]. These domains represent potential transmembrane regions or hydrophobic leader 
sequences. 

Open reading frames were predicted from fragmented nucleotide sequences using the 
program ORFFINDER (NCBI). 

Underlined amino acid sequences indicate possible transmembrane domains or leader 
sequences in the ORFs, as predicted by the PSORT algorithm (http://www.psort.nibb.ac.jp). 
Functional domains were also predicted using the MOTIFS program (GCG Wisconsin & 
PROSITE). 

For each of the following examples: based on the presence of a putative leader 
sequence and/or several putative transmembrane domains (single-underlined) in the 
gonococcal protein, it is predicted that the proteins from N. meningitidis and A^. gonorrhoeae, 
and their respective epitopes, could be useful antigens or immunogenic compositions for 
vaccines or diagnostics. 

The standard techniques and procedures which may be employed in order to perform 
the invention (eg. to utilize the disclosed sequences for vaccination or diagnostic purposes) 
were summarized above. This summary is not a limitation on the invention but, rather, gives 
examples that may be used, but are not required. 

In particular, the following methods were used to express, purify and biochemically 
characterize the proteins of the invention. 
Chromosomal DNA Preparation 

Kmeningitidis strain 2996 was grown to exponential phase in 100ml of GC medium, 
harvested by centrifiigation, and resuspended in 5ml buffer (20%(w/v) Sucrose, 50mM Tris- 
HCl, 50mM EDTA, pH8). After 10 minutes incubation on ice, the bacteria were lysed by 
adding 10ml of lysis solution (SOmMNaCl, 1% Na-Sarkosyl, SO^ig/ml Proteinase K), and the 
suspension incubated at 3TC for 2 hours. Two phenol extractions (equilibrated to pH 8) and 
one CHCb/isoamylalcohol (24:1) extraction were performed. DNA was precipitated by 
addition of 0.3M sodium acetate and 2 volumes of ethanol, and collected by centrifiigation. 
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The pellet was washed once with 70%(v/v) ethanol and redissolved in 4.0ml TE buffer 
(lOmM Tris-HCl, ImM EDTA, pH 8.0). The DNA concentration was measured by readmg 
the OD at 260 nm. 

Oligonucleotide design 

Synthetic oligonucleotide primers were designed on the basis of the coding sequence 
of each ORF, using (a) the meningococcus B sequence when available, or (b) the 
gonococcus/meningococcus A sequence, adapted to the codon preference usage of 
meningococcus as necessary. Any predicted signal peptides were omitted, by designing the 5' 
primers to sequence immediately downstream from the predicted leader sequence. 

For most ORFs, the 5' primers included two restriction enzyme recognition sites 
{BamHl-Ndel, BamHl-Nhel, EcoRI-Ndel ox EcoRl-Nhel\ depending on the restriction pattern 
of the gene of interest. The 3' primers included ?iXhol or aTfmcOII restriction site (table 1). 
This procedure was established in order to direct the cloning of each amplification product 
(corresponding to each ORF) into two different expression systems: pGEX-KG (using 
BamHI'Xhol, BamHl-HindSi, EcoRl-Xhol or EcoRl-HmdllT), and pET2 1 b+ (using Ndel- 
Xhol, Nhel-Xhol, NdeVHindlll or Nhel-Hindmy 



5 '-end primer tail: 



3 '-end primer tail: 



CGC GGATCCCATATG 
CGC GGATCCGCTAGC 
CCG GAATTC T ACATATG 
C CGGAATTC T AGC T AGC 



CCCG CTCGAG 
CCCGCTCGAG 



{BamHl-Ndel ) 
(BamHl-Nhel) 
(EcoRl-NdeT) 
{EcoRVNhel) 

{Xhol) 
{HindSS) 



For cloning ORFs into the pGEX-His vector, the 5' and 3' primers contained only one 
restriction enzyme site {EcoRl, Kpnl or SaR for the 5' primers and Pstl, Xbal, Sphl or SaR for 
the 3' primers). Again restriction sites were chosen according to the particular restriction 
pattern of the gene (table 1 ). 



5 '-end primer tail: 



3 '-end primer tail: 



(AAA)AA AGAATTC 
(AAA) AAAGGTACC 
(AAA) AA ACTGCAG 
(AAA) AAATCTAGA 



{EcoRl ) 
(Kpnl) 
(Pstl) 
(Xbal) 
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AAAGCATGC (SphI) 
5 ' or 3'-end primer tail: AAAAAAGTCGAC (Sail) 

As well as containing the restriction enzyme recognition sequences, the primers 
included nucleotides which hybridized to the sequence to be amplified. The melting 
temperature depended on the number and type of hybridising nucleotides in the whole primer, 
and was determined for each primer using the formulae: 



Tm = 4 (Gri-C)+ 2 (A+T) (tail excluded) 

Tn,= 64.9 + 0.41 (% GC) - 600/N (whole primer) 

The melting temperatures of the selected oligonucleotides were usually 65-70°C for 
the whole oligo and SO-SS'^C for the hybridising region alone. 

Table 1 shows the forward and reverse primers used for each amplification. In certain 
cases, the sequence of the primer does not exactly match the sequence of the predicted ORF. 
This is because when initial amplifications were performed, the complete 5' and/or 3' 
sequences for some meningococcal B ORFs were not known. However the corresponding 
sequences had been identified in Gonococcus or in Meningoccus A. Hence, when the 
Meningoccus B sequence was incomplete or uncertain. Gonococcal or Meningococcal A 
sequences were used as the basis for primer design. These sequences were altered to take 
account of codon preference. It can be appreciated that, once the complete sequence is 
identified, this approach will no longer be necessary. 

Oligonucleotides were synthesized using a Perkin Elmer 394 DNA/RNA Synthesizer, 
eluted fi-om the columns in 2.0ml NH4OH, and deprotected by 5 hours incubation at 56°C. 
The oUgos were precipitated by addition of 0.3M Na- Acetate and 2 volumes ethanol. The 
samples were centrifiiged and the pellets resuspended in either lOOfxl or 1.0ml of water. The 
OD260 was determined using a Perkin Elmer Lambda Bio spectophotometer and the 
concentration adjusted to 2-10pmol/|xl. 



Amplification 

The standard PGR protocol was as follows: 50"200ng of genomic DNA was used as a 
template in the presence of 20-40^M of each oligonucletide primer, 400-800^M dNTPs 
solution, Ix PGR buffer (including 1 .5mM MgCh). 2.5 units TaqIDNA polymerase (using 
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Perkin-Elmer AmpliTaQ, GIBCO Platinum, Pwo DNA polymerase, or Tahara Shuzo Taq 
polymerase). In some cases, PGR was optimsed by the addition of lO^il DMSO or 50^1 2M 
Betaine. 

After a hot start (adding the polymerase during a preliminary 3 minute incubation of 
the whole mix at 95°C), each sample underwent a two-step amplification. The first 5 cycles 
were performed using the hybridization temperature that excluded the restriction enzyme tail 
of the primer (see above). This was followed by 30 cycles using the hybridization temperature 
calculated for the whole length oligos. The cycles were completed with a 10 minute extension 
step at 72*^0. The standard cycles were as follows: 





Denaturation 


Hybridisation 


Elongation 


First 5 cycles 


30 seconds 
95°C 


30 seconds 
50-55**C 


30-60 seconds 
72^C 


Last 30 cycles 


30 seconds 
95°C 


30 seconds 
65-70°C 


30-60 seconds 
72°C 



Elongation times varied according to the length of the ORF to be amplified. 
Amplifications were performed using either a 9600 or a 2400 Perkin Elmer GeneAmp PGR 
System. To check the results, 1/10 of the amplification volume was loaded onto a 1-1.5% 
(w/v) agarose gel and the size of each amplified firagment compared with a DNA molecular 
weight marker. 

The amplified DNA was either loaded directly on a 1% agarose gel or first 
precipitated with ethanol and resuspended in a volume suitable to be loaded on a 1 .0% 
agarose gel. The DNA firagment corresponding to the band of correct size was purified using 
the Qiagen Gel Extraction Kit, following the manufacturer's protocol. DNA fi:agments were 
eluted in a volume of 30^1 or SOjil with either H20 or lOmM Tris, pH 8.5. 

Digestion of PGR fragments 

The purified DNA corresponding to the amplified Augment was doubly-digested with 
the appropriate restriction enzymes for; cloning into pET-21b+ and expressing the protein as a 
C-terminus His-tagged fiision, for cloning into pGEX-KG and expressing the protein as a N- 
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terminus GST-fusion, and for cloning into pGEX-His and expressing the protein as a 
N-terminus GST-His tagged fusion. 

Each purified DNA fragment was incubated at 37°C for 3 hours to overnight with 20 
units of appropriate restriction enzyme (New England Biolabs) in a volume of either 30 or 
40^.1 in the presence of suitable digestion buffer. Digested fragments were purified using the 
QIAquick PGR purification kit (following the manufacturer's instructions) and eluted in a 
volume of 30^il or 50^1 with either H20 or lOmM Tris, pH 8.5. The DNA concentration was 
determined by quantitative agarose gel electrophoresis (1.0% gel) in the presence of a titrated 
molecular weight marker. 



Digestion of the cloning vectors (pET22B, pGEX-KG, pTRC-His A, pET21b+, pGEX- 
KG, and pGEX-His) 

The vector pGEX-His is a modified pGEX-2T vector carrying a region encoding six 
histidine residues upstream of the thrombin cleavage site and containing the multiple cloning 
site of the vector pTRC99 (Pharmacia). 10 fig plasmid was double-digested with 50 units of 
each restriction enzyme in 200 |il reaction volume in the presence of appropriate buffer by 
overnight incubation at 3TC. After loading the whole digestion on a 1% agarose gel, the 
band corresponding to the digested vector was purified from the gel using the Qiagen 
QIAquick Gel Extraction Kit and the DNA was eluted in 50 ^il of 10 mM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD260 of the sample, and adjusted to 50 
lig/lil 1 III of plasmid was used for each cloning procedure. 

lO^ig of plasmid vector was doubly-digested with 50 units of each restriction enzyme 
in a volume of 200|al with the appropriate buffer overnight at 3TC, The digest was loaded 
onto a 1 .0% agarose gel and the band corresponding to the digested vector purified using the 
Qiagen QIAquick Gel Extraction Kit. DNA was eluted in 50^1 of lOmM Tris-HCI, pH 8.5. 
The DNA concentration was evaluated by measuring ODzeonm and the concentration adjusted 
to 50^g/^l. l^il of plasmid was used for each cloning procedure. 

Cloning 

For some ORFs, the fi-agments corresponding to each ORF, previously digested and 
purified, were ligated in both pET22b and pGEX-KG. In a final volume of 20 ^ll, a molar 
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ratio of 3:1 fragment/vector was ligated using 0.5 ill of NEB T4 DNA ligase (400 units/jxl), in 
the presence of the buffer supplied by the manufacturer. The reaction was incubated at room 
temperature for 3 hours. In some experiments, ligation was performed using the Boheringer 
"Rapid Ligation Kit", following the manufacturer's instructions. 

In order to introduce the recombinant plasmid in a suitable strain, 100 ^1 E. coli DH5 
competent cells were incubated with the ligase reaction solution for 40 minutes on ice, then at 
3TC for 3 minutes, then, after adding 800 |al LB broth, again at 37°C for 20 minutes. The 
cells were then centrifuged at maximum speed in an Eppendorf microfuge and resuspended in 
approximately 200 |il of the supernatant. The suspension was then plated on LB ampicilUn 
(100 mg/ml ). 

The screening of the recombinant clones was performed by growing 5 
randomly-chosen colonies overnight at 37 °C in either 2 ml (pGEX or pTC clones) or 5ml 
(pET clones) LB broth + 100 ^g/ml ampicillin. The cells were then pelletted and the DNA 
extracted using the Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's 
instructions, to a final volume of 30 ^1. 5 \i\ of each individual miniprep (approximately Ig ) 
were digested with either ^^/eIZA7^oI or BamHllXhol and the whole digestion loaded onto a 1- 
1.5% agarose gel (depending on the expected insert size), in parallel with the molecular 
weight marker (1Kb DNA Ladder, GIBCO). The screening of the positive clones was made 
on the base of the correct insert size. 

For other ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated into both pET21b+ and pGEX-KG. A molar ratio of of 3:1 
fi^gment/vector was used in a final volume of 20^1, that included 0,5^1 T4 DNA ligase (400 
units/^1, NEB) and ligation buffer suppUed by the manufacturer. The reaction was performed 
at room temperature for 3 hours. In some experiments, ligation was perfonned using the 
Boheringer "Rapid Ligation Kit" and the manufacturer's protocol. 

Recombinant plasmid was transformed into 100|al of competent ^. coli DH5 or 
HBlOl by incubating the ligase reaction solution and bacteria for 40 minutes on ice then at 
37°C for 3 minutes. This was followed by the addition of 800^1 LB broth and incubation at 
37°C for 20 minutes. The cells were centrifuged at maximum speed in an Eppendorf 
microfiige, resuspended in approximately 200^1 of the supernatant and plated onto LB 
ampicillin (lOOmg/ml ) agar. 
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Screening for recombinant clones was performed by growing 5 randomly selected 
colonies overnight at 37°C in either 2.0ml (pGEX-KG clones) or 5.0ml (pET clones) LB 
broth + lOOixg/ml ampicillin. Cells were pelleted and plasmid DNA extracted using the 
Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's instructions. 
Approximately l|ag of each individual miniprep was digested with the appropriate restriction 
enzymes and the digest loaded onto a 1-1.5% agarose gel (depending on the expected insert 
size), in parallel with the molecular weight marker (Ikb DNA Ladder, GIBCO). Positive 
clones were selected on the basis of the size of insert. 

ORFs were cloned into PGEX-His, by doubly-digesting the PGR product and ligating 
into similarly digested vector. After cloning, recombinant plasmids were transformed into the 
E,coli host W3 110. Individual clones were grown overnight at 37°C in LB broth with 50|ag/ml 
ampicillin. 

Certain ORFs may be cloned mto the pGEX-HIS vector using EcoRVPstl cloning 
sites, or EcoRl-Sall, or Sall-Pstl, After cloning, the recombinant plasmids may be introduced 
in the^coli host W3110. 

Expression 

Each ORE cloned into the expression vector may then be transformed into the strain 
suitable for expression of the recombinant protein product. 1 ^il of each construct was used to 
transform 30 |al of Kcoli BL21 (pGEX vector), Kcoli TOP 10 (pTRC vector) or E.coli BL21- 
DE3 (pET vector), as described above. In the case of the pGEX-His vector, the same Kcoli 
strain (W3110) was used for mitial cloning and expression. Single recombinant colonies were 
inoculated into 2ml LB+Amp (100 ^g/ml), incubated at 37°C overnight, then diluted 1:30 in 
20 ml of LB+Amp (100 ng/ml) in 100 ml flasks, making sure that the ODeoo ranged between 
0.1 and 0.15. The flasks were incubated at 30*^0 into gyratory water bath shakers until OD 
indicated exponential growth suitable for induction of expression (0.4-0.8 OD for pET and 
pTRC vectors; 0.8-1 OD for pGEX and pGEX-His vectors). For the pET, pTRC and pGEX- 
His vectors, the protein expression was induced by addiction of ImM IPTG, whereas in the 
case of pGEX system the final concentration of IPTG was 0.2 mM. After 3 hours incubation 
at 30°C, the final concentration of the sample was checked by OD. In order to check 
expression, 1ml of each sample was removed, centrifiiged in a microfiige, the pellet 
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resuspended in PBS, and analysed by 12% SDS-PAGE with Coomassie Blue staining. The 
whole sample was centrifiiged at 6000g and the pellet resuspended in PBS for fiirther use. 

GST-fusion proteins large-scale purification. 

For some ORPs, a single colony was grown overnight at 37°C on LB+Amp agar plate. 
The bacteria were inoculated into 20 ml of LB+Amp liquid colture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600 ml of fresh medium and allowed to 
grow at the optimal temperature (20-37°C) to ODsso 0.8-1. Protein expression was induced 
with 0.2mM IPTG followed by three hours incubation. The culture was centrifiiged at 8000 
rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 7.5 ml 
cold PBS. The cells were disrupted by sonication on ice for 30 sec at 40W using a Branson 
sonifier B-15, frozen and thawed two times and centrifiiged again. The supernatant was 
collected and mixed with ISO^il Glutatione-Sepharose 4B resin (Phannacia) (previously 
washed with PBS) and incubated at room temperature for 30 minutes. The sample was 
centrifiiged at 700g^ for 5 minutes at 4C. The resin was washed twice with 10 ml cold PBS for 
10 minutes, resuspended in 1ml cold PBS, and loaded on a disposable column. The resin was 
washed twice with 2ml cold PBS until the flow-through reached OD280 of 0.02-0.06. The 
GST-fusion protein was eluted by addition of 700^1 cold Glutathione elution buffer lOmM 
reduced glutathione, 50mM Tris-HCl) and fractions collected until the OD280 was 0. 1 . 21 ^1 of 
each fraction were loaded on a 12% SDS gel using either Biorad SDS-PAGE Molecular 
weight standard broad range (Ml) (200, 1 16.25, 97.4, 66.2, 45, 31, 21.5, 14.4, 6.5 kDa) or 
Amersham Rainbow Marker (M") (220, 66, 46, 30, 21 .5, 14.3 kDa) as standards. As the MW 
of GST is 26kDa, this value must be added to the MW of each GST-fiision protein. 

For other ORFs, for each clone to be purified as a GST-fiision, a single colony was 
streaked out and grown overnight at 37°C on a LB/Amp. (lOOng/ml) agar plate. An isolated 
colony from this plate was inoculated into 20ml of LB/Amp (100 \ig/ml) liquid medium and 
grown overnight at 37°C with shaking. The overnight culture was diluted 1 :30 into 600ml 
LB/Amp (lOO^g/ml) liquid medium and allowed to gro\y at the optimal temperature (20- 
37°C) until the ODssonm reached 0.6-0.8. Recombinant protein expression was induced by 
addition of IPTG (final concentration 0.2mM) and the culture incubated for a fiirther 3 hours. 
Bacteria were harvested by centrifiigation at 8000xg for 1 5 min at 4°C. 
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The bacterial pellet was resuspended in 7.5ral cold PBS. Cells were disrupted by 
sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 and centrifuged at 
13 OOOxg for 30 min at 4°C. The supernatant was collected and mixed with 150^1 Glutatione- 
Sepharose 4B resin (Pharmacia), previously equilibrated with PBS, and incubated at room 
temperature with gentle agitation for 30 min. The batch-wise preparation was centrifuged at 
700xg for 5 min at 4°C and the supernatant discarded. The resin was washed twice 
(batchwise) with 10ml cold PBS for 10 min, resuspended in 1ml cold PBS, and loaded onto a 
disposable column. The resin continued to be washed with cold PBS, until the OD280nm of the 
flow-through reached 0.02-0.01. The GST-fusion protein was eluted by addition of 700^1 cold 
glutathione elution buffer (lOmM reduced glutathione, 50mM Tris-HCl pH 8.0) and fi-actions 
collected, until the ODagonm of the eluate indicated all the recombinant protein was obtained. 
20|al aliquots of each elution fraction were analyzed by SDS-PAGE using a 12% gel. The 
molecular mass of the purified proteins was determined using either the Bio-Rad broad range 
molecular weight standard (Ml) (200, 116, 97.4, 66.2, 45.0, 31.0, 21.5, 14.4, 6.5 kDa) or the 
Amersham Rainbow Marker (M2) (220, 66.2, 46.0, 30.0, 21.5, 14.3 kDa). The molecular 
weights of GST-fiision proteins are a combination of the 26 kDa GST protein and its fusion 
partner. Protein concentrations were estimated using the Bradford assay. 

His-fusion soluble proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on a LB + Amp agar 
plate. The bacteria were inoculated into 20ml of LB+Amp liquid culture and incubated 
overnight in a water bath shaker. Bacteria were diluted 1 :30 into 600ml fresh medium and 
allowed to grow at the optimal temperature (20-37°C) to OD550 0.6-0.8. Protein expression 
was induced by addition of 1 mM IPTG and the culture further incubated for three hours. The 
culture was centrifuged at 8000 rpm at 4°C, the supernatant was discarded and the bacterial 
pellet was resuspended in 7.5ml cold lOmM imidazole buffer (300 mM NaCl, 50 mM 
phosphate buffer, 10 mM imidazole, pH 8). The cells were disrupted by sonication on ice for 
30 sec at 40W using a Branson sonifier B-15, frozen and thawed two times and centrifiiged 
again. The supernatant was collected and mixed with 150[il Ni^^-resin (Pharmacia) 
(previously washed with lOmM imidazole buffer) and incubated at room temperature with 
gentle agitation for 30 minutes. The sample was centrifuged at 700g for 5 minutes at 4°C. 
The resin was washed twice with 10 ml cold lOmM imidazole buffer for 10 minutes. 
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resuspended in 1ml cold IQmM imidazole buffer and loaded on a disposable column. The 
resin was washed at 4°C with 2ml cold lOmM imidazole buffer until the flow-through 
reached the O.D280 of 0.02-0.06. The resin was washed with 2ml cold 20niM imidazole 
buffer (300 mM NaCl, 50 mM phosphate buffer, 20 mM imidazole, pH 8) until the flow- 
through reached the O.D280 of 0.02-0.06. The His-fiision protein was eluted by addition of 
700|Lil cold 250mM imidazole buffer (300 mM NaCl, 50 mM phosphate buffer, 250 mM 
imidazole, pH 8) and fractions collected until the O.D280 was 0.1. 21|il of each fraction were 
loaded on a 12% SDS gel. 

His-fusion insoluble proteins large-scale purification. 

A single colony was grown overnight at 37 °C on a LB + Amp agar plate. The 
bacteria were inoculated into 20 ml of LB+Amp liquid culture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600ml fresh medium and let to grow at the 
optimal temperature (37°C) to O.D550 0.6-0.8. Protein expression was induced by addition 
of 1 mM IPTG and the culture further incubated for three hours. The culture was centrifuged 
at SOOOrpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 
7.5 ml buffer B (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8). The cells 
were disrupted by sonication on ice for 30 sec at 40W using a Branson sonifier B-15, frozen 
and thawed twice and centrifuged again. The supematant was stored at -20*^0, while the 
pellets were resuspended in 2 ml guanidine buffer (6M guanidine hydrochloride, lOOmM 
phosphate buffer, 10 mM Tris-HCl, pH 7.5) and treated in a homogenizer for 10 cycles. The 
product was centrifuged at 13000 rpm for 40 minutes. The supematant was mixed with 150)al 
Ni^^-resin (Pharmacia) (previously washed with buffer B) and incubated at room temperature 
with gentle agitation for 30 minutes. The sample was centrifuged at 700 g for 5 minutes at 
4°C. The resin was washed twice with 10 ml buffer B for 10 minutes, resuspended in 1ml 
buffer B, and loaded on a disposable column. The resin was washed at room temperature 
with 2ral buffer B until the flow-through reached the OD280 of 0.02-0.06. The resin was 
washed with 2ml buffer C (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 6.3) until 
the flow-through reached the O.D280 of 0.02-0.06. The His-fusion protein was eluted by 
addition of 700^il elution buffer (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 
4.5) and fractions collected until the OD280 was 0.1. 21^1 of each fraction were loaded on a 
12% SDS gel. 
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Purification of His-fusion proteins. 

For each clone to be purified as a His-fusion, a single colony was streaked out and 
grown overnight at 37°C on a LB/Amp (100 ^ig/ml) agar plate. An isolated colony from this 
plate was inoculated into 20ml of LB/Amp (100 ^g/ml) liquid medium and grown overnight 
at 37°C with shaking. The overnight culture was diluted 1 :30 into 600ml LB/Amp (100 
Hg/ml) liquid medium and allowed to grow at the optimal temperature (20-37°C) until the 
OD550nm reached 0.6-0.8. Expression of recombinant protein was induced by addition of IPTG 
(final concentration l.OmM) and the culture incubated for a further 3 hours. Bacteria were 
harvested by centrifugation at SOOOxg for 15 min at 4°C. 

The bacterial pellet was resuspended in 7.5ml of either (i) cold buffer A (300raM 
NaCl, 50mM phosphate buffer, lOmM imidazole, pH 8.0) for soluble proteins or (ii) buffer B 
(8M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8) for insoluble proteins. Cells 
were disrupted by sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 
and centrifuged at 13 OOOxg for 30 min at 4°C. For insoluble proteins, pellets were 
resuspended in 2.0 ml buffer C (6M guanidine hydrochloride, lOOmM phosphate buffer, . 
lOmM Tris-HCl, pH 7.5) and treated with a Bounce homogenizer for 10 cycles. The 
homogenate was centrifuged at 1 3 OOOxg for 40 min and the supernatant retained. 

Supematants for both soluble and insoluble preparations were mixed with 150|j1 Ni^*- 
resin (previously equiUbrated with either buffer A or buffer B, as appropriate) and incubated 
at room temperature with gentle agitation for 30 min. The resin was Chelating Sepharose Fast 
Flow (Pharmacia), prepared according to manufacturers protocol. The batch-wise preparation 
was centrifuged at 700xg for 5 min at 4°C and the supernatant discarded. The resin was 
washed twice (batch-wise) with 10ml buffer A or B for 10 min, resuspended in 1.0 ml buffer 
A or B and loaded onto a disposable column. The resin continued to be washed with either (i) 
buffer A at 4°C or (ii) buffer B at room temperature, until the ODzgonm of the flow-through 
reached 0.02-0.01. The resin was further washed with either (i) cold buffer C (300mM NaCl, 
SOmM phosphate buffer, 20mM imidazole, pH 8.0) or (ii) buffer D (8M urea, lOmM Tris- 
HCl, lOOmM phosphate buffer, pH 6.3) until the the ODzgonm of the flow-through reached 
0.02-0.01. The His-fusion protein was eluted by addition of 700^1 of either (i) cold elution 
buffer A (300mM NaCl, SOmM phosphate buffer, 250mM imidazole, pH 8.0) or (ii) elution 
buffer B (8 M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 4.5) and fractions 
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collected until the O.D280nm indicated all the recombinant protein was obtained. 20^1 aliquots 
of each elution fraction were analyzed by SDS-PAGE using a 12% gel. Protein concentrations 
were estimated using the Bradford assay. 

His-fusion proteins renaturation 

In the cases where denaturation was required to solubilize proteins, a renaturation step 
was employed prior to mununization. Glycerol was added to the denatured fractions obtained 
above to give a final concentration of 10%(v/v). The proteins were diluted to 200^g/ml using 
dialysis buffer I (10% (v/v) glycerol, 0.5M arginine, 50mM phosphate buffer, S.OmM reduced 
glutathione, 0.5mM oxidised glutathione, 2.0M urea, pH 8.8) and dialysed against the same 
buffer for 12-14 hours at 4^*0. Further dialysis was performed with buffer U (10% (v/v) 
glycerol, 0.5M arginine, 50mM phosphate buffer, S.OmM reduced glutathione, O.SmM 
oxidised glutathione, pH 8.8) for 12-14 hours at 4**C. 

Alternatively, 10% glycerol was added to the denatured proteins. The proteins were 
then diluted to 20^g/ml using dialysis buffer I (10% glycerol, 0.5M arginine, 50mM 
phosphate buffer, 5mM reduced glutathione, 0.5mM oxidised glutathione, 2M urea, pH 8.8) 
and dialysed against the same buffer at 4°C for 12-14 hours. The protein was fiirther dialysed 
against dialysis buffer H (10% glycerol, 0.5M arginine, 50mM phosphate buffer, 5mM 
reduced glutathione, O.SmM oxidised glutathione, pH 8.8) for 12-14 hours at 4*'C. 

Protein concentration was evaluated using the formula: 

Protein (mg/ml) = (1.55 x OD280) - (0.76 x OD260) 

Purification of proteins 

To analyse the solubility, pellets obtained from 3.0ml cultures were resuspended in 
500^1 buffer Ml (PBS pH 7.2). 25^1 of lysozyme (lOmg/ml) was added and the bacteria 
incubated for 1 5 min at 4°C. Cells were disrupted by sonication on ice four times for 30 sec at 
40W using a Branson sonifier 450 and centrifiiged at 13 OOOxg for 30 min at 4°C. The 
supernatant was collected and the pellet resuspended in buffer M2 [8M urea, 0.5M NaCl, 
20mM imidazole and O.IM NaH2 PO4] and incubated for 3 to 4 hours at 4^C. After 
centrifiigation, the supernatant was collected and the pellet resuspended in buffer M3 [6M 
guanidinium-HCl, 0.5M NaCl, 20mM imidazole and 0.1MNaH2PO4] overnight at 4°C. The 
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supematants from all steps were analysed by SDS-PAGE. Some proteins were found to be 
soluble in PBS, others need urea or guanidium-HCl for solubilization. 

For preparative scale purifications, 500ml cultures were induced and fusion proteins 
solubilized in either buffer Ml, M2 or M3 using the procedure described above. Crude 
extracts were loaded onto a Ni-NTA superflow column (Quiagen) equilibrated with buffer 
Ml, M2 or M3 depending on the solubilization buffer employed. Unbound material was 
eluted by washing the column with the same buffer. The recombinant fusion protein was 
eluted with the corresponding buffer containing 500mM imidazole then dialysed against the 
same buffer in the absence of imidazole. 

Mice immunisations 

20|ag of each purified protein are used to immunise mice intraperitoneally . In the case 
of some ORFs, Balb-C mice were immunised with A1(0H)3 as adjuvant on days 1, 21 and 42, 
and immune response was monitored m samples taken on day 56. For other ORFs, CDl mice 
could be immunised using the same protocol For ORFs 25 and 40, CDl mice were 
immunised using Freund's adjuvant, and the same immunisation protocol was used, except 
that the immune response was measured on day 42, rather than 56. Similarly, for still other 
ORFs, CDl mice were immunised with Freund's adjuvant, but the immune response was 
measured on day 49. Alternatively, 20ng of each purified protein was mixed with Freund's 
adjuvant and used to immunise CDl mice intraperitoneally. For many of the proteins, the 
immunization was performed on days 1, 21 and 35, and immune response was monitored in 
samples taken on days 34 and 49. For some proteins, the third immunization was performed 
on day 28, rather than 35, and the immune response was measured on days 20 and 42, rather 
than 34 and 49. 

ELISA assay (sera analysis) 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37''C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 7ml of Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD620. The 
bacteria were let to grow until the OD reached the value of 0.3-0.4. The culture was 
centrifuged for 10 minutes at 10000 rpm. The supematant was discarded and bacteria were 
washed once with PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated 
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for 2 hours at room temperature and then ovemight at 4°C with stirring. lOOfil bacterial cells 
were added to each well of a 96 well Gremer plate and incubated ovemight at 4°C. The wells 
were then washed three times with PBT washing buffer (0. 1% Tween-20 in PBS). 200 ^1 of 
saturation buffer (2.7% Polyvmylpyrrolidone 10 in water) was added to each well and the 



diluted sera (Dilution buffer: 1% BSA, 0.1% Tween-20, 0.1% NaNs in PBS) were added to 
each well and the plates incubated for 90 minutes at 37°C. Wells were washed three times 
with PBT. 1 00 ixl of HRP-conjugated rabbit anti-mouse (Dako) serum diluted 1 :2000 in 
dilution buffer were added to each well and the plates were incubated for 90 minutes at 3TC. 
Wells were washed three times with PBT buffer. 100 lal of substrate buffer for HRP (25 ml of 
citrate buffer pH5, 10 mg of O-phenildiamine and 10 |al of H2O) were added to each well and 
the plates were left at room temperature for 20 minutes. 100 \xl H2SO4 was added to each 
well and OD490 was followed. The ELISA was considered positive when OD490 was 2.5 
times the respective pre-immune sera. 

Alternatively, The acapsulated MenB M7 strain was plated on chocolate agar plates 
and incubated ovemight at 37'*C. Bacterial colonies were collected from the agar plates using 
a sterile dracon swab and inoculated into Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD620. The bacteria 
were let to grow until the OD reached the value of 0.3-0.4. The culture was centrifiiged for 10 
minutes at 10 OOOrpm. The supernatant was discarded and bacteria were washed once with 
PBS, resuspended in PBS containing 0.025% fomialdehyde, and incubated for 1 hour at 37^*0 
and then ovemight at 4°C with stirring. 100^1 bacterial cells were added to each well of a 96 
well Greiner plate and incubated ovemight at 4°C. The wells were then washed three times 
with PBT washing buffer (0.1% Tween-20 in PBS). 200^1 of saturation buffer (2.7% 
Polyvinylpyrrolidone 10 in water) was added to each well and the plates incubated for 2 hours 
at 37''C. Wells were washed three times with PBT. 200^il of diluted sera (Dilution buffer: 1% 
BSA, 0. 1% Tween-20, 0.1% NaNa in PBS) were added to each well and the plates incubated 
for 2 hours at 37°C. Wells were washed three times with PBT. 100^1 of HRP-conjugated 
rabbit anti-mouse (Dako) serum diluted 1 :2000 in dilution buffer were added to each well and 
the plates were incubated for 90 minutes at 37°C. Wells were washed three times with PBT 
buffer. lOOfxl of substrate buffer for HRP (25ml of citrate buffer pH5, lOmg of O- 



plates incubated for 2 hours at Wells were washed three times with PBT. 200 \xl of 
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phenildiamine and 10^1 of H2O2) were added to each well and the plates were left at room 
temperature for 20 minutes. lOOjil of 12.5% H2SO4 was added to each well and OD490 was 
followed. The ELISA titers were calculated abitrarely as the dilution of sera which gave an 
OD490 value of 0.4 above the level of preimmune sera. The ELISA was considered positive 
when the dilution of sera with OD490 of 0.4 was higher than 1 :400. 

FACScan bacteria Binding Assay procedure. 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 4 tubes containing 8ml each Mueller-Hinton Broth (Difco) 
containing 0.25% glucose. Bacterial growth was monitored every 30 minutes by following 
OD620. The bacteria were let to grow until the OD reached the value of 0.35-0.5. The culture 
was centrifuged for 10 minutes at 4000rpm. The supernatant was discarded and the pellet was 
resuspended in blocking buffer (1% BSA in PBS, 0.4% NaNa) and centrifuged for 5 minutes 
at 4000rpm. Cells were resuspended in blocking buffer to reach OD620 of 0.07. 100[il bacterial 
cells were added to each well of a Costar 96 well plate. 1 00^1 of diluted ( 1 : 1 00, 1 :200, 1 :400) 
sera (in blocking buffer) were added to each well and plates incubated for 2 hours at 4^*0. 
Cells were centrifuged for 5 minutes at 4000rpm, the supernatant aspirated and cells washed 
by addition of 200|al/well of blocking buffer in each well. 100^x1 of R-Phicoerytrin conjugated 
F(ab)2 goat anti-mouse, diluted 1 : 100, was added to each well and plates incubated for 1 hour 
at 4'*C. Cells were spun down by centrifiigation at 4000rpm for 5 minutes and washed by 
addition of 200^1/well of blocking buffer. The supernatant was aspirated and cells 
resuspended in 200^1/weU of PBS, 0.25% fomialdehyde. Samples were transferred to 
FACScan tubes and read. The condition for FACScan (Laser Power 15mW) setting were: FL2 
on; FSC-H threshold:92; FSC PMT Voltage: E 01; SSC PMT: 474; Amp. Gains 6.1; FL-2 
PMT: 586; compensation values: 0. 

OMV preparations 

Bacteria were grown overnight on 5 GC plates, harvested with a loop and resuspended 
in 10 ml 20mM Tris-HCl. Heat inactivation was performed at 56''C for 30 minutes and the 
bacteria disrupted by sonication for 10' on ice ( 50% duty cycle, 50% output ). Unbroken 
cells were removed by centrifiigation at 5000g for 10 minutes and the total cell envelope 
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fraction recovered by centrifugation at 50000^ at 4X for 75 minutes. To extract cytoplasmic 
membrane proteins from the crude outer membranes, the whole fraction was resuspended in 
2% sarkosyl (Sigma) and incubated at room temperature for 20 minutes. The suspension was 
centrifuged at lOOOOg for 10 minutes to remove aggregates, and the supernatant further 
ultracentrifiiged at 50000^ for 75 minutes to pellet the outer membranes. The outer 
membranes were resuspended in lOmM Tris-HCl, pH8 and the protein concentration 
measured by the Bio-Rad Protein assay, using BSA as a standard. 

Whole Extracts preparation 

Bacteria were grown overnight on a GC plate, harvested with a loop and resuspended 
in 1ml of 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30' minutes. 

Western blotting 

Purified proteins (500ng/lane), outer membrane vesicles (5)ag) and total cell extracts 
(25|ig) derived from MenB strain 2996 were loaded onto a 12% SDS-polyacrylamide gel and 
transferred to a nitrocellulose membrane. The transfer was performed for 2 hours at 1 50mA at 
4*^0, using transfer buffer (0.3 % Tris base, 1 .44 % glycine, 20% (v/v) methanol). The 
membrane was saturated by overnight incubation at 4^*0 in saturation buffer (10% skimmed 
milk, 0.1% Triton XlOO in PBS). The membrane was washed twice with washing buffer (3% 
skimmed milk, 0.1% Triton XlOO in PBS) and incubated for 2 hours at 3TC with mice sera 
diluted 1 :200 in washing buffer. The membrane was washed twice and incubated for 90 
minutes with a 1 :2000 dilution of horseradish peroxidase labelled anti-mouse Ig. The 
membrane was washed twice with 0.1% Triton XlOO in PBS and developed with the Opti- 
4CN Substrate Kit (Bio-Rad). The reaction was stopped by adding water. 

Bactericidal assay 

MC58 and 2996 strains were grown overnight at 37'*C on chocolate agar plates. 5-7 
colonies were collected and used to inoculate 7ml Mueller-Hinton broth. The suspension was 
incubated at 37°C on a nutator and let to grow until OD620 was in between 0.5-0.8. The 
culture was aliquoted into sterile 1.5ml Eppendorf tubes and centrifuged for 20 minutes at 
maximum speed in a microfuge. The pellet was washed once in Gey's buffer (Gibco) and 
resuspended in the same buffer to an OD620 of 0.5, diluted 1 :20000 in Gey's buffer and stored 
at 25''C. 
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50pl of Gey's bufrer/1% BSA was added to each well of a 96-well tissue culture plate. 
25\xl of diluted (1 :100) mice sera (dilution buffer: Gey's bufifer/0.2% BSA) were added to 
each well and the plate incubated at 4°C. 25)al of the previously described bacterial 
suspension were added to each well. 25)al of either heat-inactivated (56°C waterbath for 30 
minutes) or normal baby rabbit complement were added to each well. Immediately after the 
addition of the baby rabbit complement, 22^1 of each sample/well were plated on Mueller- 
Hinton agar plates (time 0). The 96-well plate was incubated for 1 hour at 37°C with rotation 
and then 22)al of each sample/well were plated on Mueller-Hinton agar plates (time 1). After 
overnight incubation the colonies corresponding to time 0 and time Ih were counted. 

Gene Variability 

The OItF4 and 919 genes were amplified by PGR on chromosomal DNA extracted 
from various Neisseria strains (see list of strains). The following oligonucleotides used as 
PGR primers were designed in the upstream and downstream regions of the genes: 

orf4.1 (forward) CGAATCCGGACGGCAGGACTC 
orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 

919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 
919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

The PGR cycling conditions were: 

1 cycle 2 min. at 94° 

30 cycles 30 sec, at 94° 

30 sec. at - 54° or - 60° ( in according to Tm of the primers) 

40 sec. at 72° 
1 cycle 7 min. at 72° 

The PGR products were purified fi-om 1 % agarose gel and sequenced using the 
following primers: 

orf 4.1 (forward) CGAATCCGGACGGCAGGACTC 
orf 4.2 (forward) CGACCGCGCCTTTGGGACTG 
orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 
orf 4.4 (reverse) TCTTTGAGTTTGATCCAACC 
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919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 

919.2 (forward) ATCCTTCCGCCTCGGCTGCG 

919.3 (forward) AAAACAGCGGCACAATCGAC 

919.4 (forward) ATAAGGGCTACCTCAAACTC 

919.5 (forward) GCGCGTGGATTATTTTTGGG 

919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

919.7 (reverse) CCCAAGGTAATGTAGTGCCG 

919.8 (reverse) TAAAAAAAAGTTCGACAGGG 

919.9 (reverse) CCGTCCGCCTGTCGTCGCCC 

919.10 (reverse) TCGTTCCGGCGGGGTCGGGG 

All documents cited herein are incorporated by reference in their entireties. 
The following Examples are presented to illustrate, not limit, the invention 

EXAMPLE 1 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PGR) assay in order to clone the OKFs as 
indicated: 



Table 1: Oligonucleotides used for PGR for Examples 2-10 



ORF 


Primer 


Sequence 


Restriction sites 


279 


Forward 


CGCGGATCCCATATG-TTGCCTGCAATCACGATT 
<SEQID3021> 


BamHI-Ndel 




Reverse 


CCCGCTCGAG-TTTAGAAGCGGGCGGCAA <SFO 
ID 3022> 


Xhol 


519 


Forward 


CGCGGATCCCATATG-TTCAAATCCTTTGTCGTnA 
<SEQ ID3023> 


BamHI-Ndel 




Reverse 


CCCGCTCGAG-TTTGGCGG ITTTGCTGC <SEQ ID 
3024> 


Xhol 


576 


Forward 


CGCGGATCCCATATG-GCCGCCCCCGCATnT 
<SEQ1D3025> 


BamHI-Ndel 




Reverse 


CCCGCTCGAG-ATTTACTTTTTTGATGTCGAC 
<SEQ ID 3026> 


Xhol 


919 


Forward 


CGCGGATCCCATATG-TGCCAAAGCAAGACnATr: 
<SEQ ID 3027> 


BamHI-Ndel 




Reverse 


CCCGCTCGAG-CGGGCGGTATTCGGG <SEQ ID 
3028> 


Xhol 


121 


Forward 


CGCGGATCCCATATG-GAAACACAGCTTTAr.AT 
<SEQ ID3029> 


BamHI-Ndel 
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Reverse 


CCCGCTCGAG-ATAATAATATCCCGCGCCC <SEQ 
ID 3030> 


Xhol 


128 


Forward 


CGCGGATCCCATATG-ACTGACAACGCACT <SEO 
ID 3031 > 


BamHI-Ndel 




Reverse 


CCCGCTCGAG-GACCGCGTTGTCGAAA <SEQ ID 
3032> 


Xhol 


206 


Forward 


CGCGGATCCCATATG-AAACACCGCCAACCGA 
<SEQ ID 3033> 


BamHI-Ndel 




Reverse 


CCCGCTCGAG-TTCTGTAAAAAAAGTATGTGC 
<SEQ ID 3034> 


Xhol 


287 


Forward 


CCGGAATTCTAGCTAGC-CTTTCAGCCTGCGGG 
<SEQ ID 3035> 


EcoRI-Nhel 




Reverse 


CCCGCTCGAG-ATCCTGCTC 1 1 1 1 1 IGCC <SEO in 
3036> 


Xhol 


406 


Forward 


CGCGGATCCCATATG-TGCGGGACACTGACAG 
<SEQ ID 3037> 


BamHI-Ndel 




Reverse 


CCCGGTCGAG-AGGTTGTCCTTGTCTATG <SEO 
ID 3038> 


Xhol 



Localization of the ORFs 

The following DNA and amino acid sequences are identified by titles of the following 
form: [g, m, or a] [#].[seq or pep], where "g" means a sequence from N. gonorrhoeae, "m" 
means a sequence fi-omA^. meningitidis B, and "a" means a sequence fi-om N. meningitidis A; 
"#" means the number of the sequence; "seq" means a DNA sequence, and "pep" means an 
amino acid sequence. For example, "gOOl.seq" refers to an N. gonorrohoeae DNA sequence, 
number 1. The presence of the suffix "-1" to these sequences indicates an additional sequence 
found for the same ORF, thus, data for an ORF having both an unsuffixed and a suffixed 
sequence designation applies to both such designated sequences. Further, open reading 
fi-ames are identified as ORF #, where "#" means the number of the ORF, corresponding to 
the number of the sequence which encodes the ORF, and the ORF designations may be 
suffixed with ".ng" or ".a", indicating that the ORF corresponds to a AT. gonorrhoeae sequence 
or meningitidis A sequence, respectively. The word "partial" before a sequence indicates 
that the sequence may be a partial or a complete ORF. Computer analysis was performed for 
the comparisons that follow between "g", "m", and "a" peptide sequences; and therein the 
"pep" suffix is implied where not expressly stated. Further, in the event of a conflict between 
the text immediately preceding and describing which sequences are being compared, and the 
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designated sequences being compared, the designated sequence controls and is the actual 
sequence being compared. 

ORF: contig: 



The following partial DNA sequence was identified in A^, meningitidis <SEQ ID 3039>: 
m279.seq 

1 ATAACGCGGA TTTGCGGCTG CTTGATTTCA ACGGTTTTCA GGGCTTCGGC 
51 AAGTTTGTCG GCGGCGGGTT TCATCAGGCT GCAATGGGAA GGTACGGACA 
101 CGGGCAGCGG CAGGGCGCGT TTGGCACCGG CTTCTTTGGC GGCAGCCATG 
151 GCGCGTCCGA CGGCGGCGGC GTTGCCTGCA ATCACGATTT GTCCGGGTGA 
201 GTTGAAGTTG ACGGCTTCGA CCACTTCGCT TTGGGCGGCT TCGGCACAAA 
251 TGGCTTTAAC CTGCTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 
3 01 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA GGCGCACGAG 
351 TTTGACCGCG TCGGCAAAAT TC7ATGCGCC GGCGGCAACG AGTGCGGTGT 
401 ATTCGCCGAG GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 
451 TCTAAATAG 

This corresponds to the amino acid sequence <SEQ ID 3040; ORF 279>: 
m279.pep 

1 ITRICGCLIS TVFRASASLS AAGFIRLQWE GTDTGSGRAR lAPASLAAAM 
51 ARPTAAA LPA ITICPGELKL TASTTSLWAA SAQMALTCSS SKPRIAAIAP 
101 TPCGTADCIS SARRRTSLTA SAKFNAPAAT SAVYSPRLCP ATAAGVLPPA 
151 SK* 

The following partial DNA sequence was identified mN. gonorrhoeae <SEQ ID 304 1>: 

g279.seq 

1 atgacgcgga tttgcggctg cttgatttca acggttttga gtgtttcggc 
51 aagtttgtcg gcggcgggtt tcatcaggct gcaatgggaa ggaacggata 
101 ccggcagcgg cagggcgcgt ttggctccgg cttctttggc ggcagccatg 
151 gtgcgtccga cggcggcggc gttgcctgca atcacgactt gtccgggcga 
201 gttgaagttg acggcttcga ccacttcgcc ctgtgcggat tcggcacaaa 
251 tctgcctgac ctgttcatct tccaaaccca aaatggccgc cattgcgcct 
3 01 acgccttgcg gtacggcgga ctgcatcagt tcggcgcgca ggcggacgag 
351 tttgacggca tcggcaaaat ccaatgcttc ggcggcgaca agcgcggtgt 
401 attcgccgag gctgtgtccg gcaacggcgg caggcgtttt gccgcccact 
451 tccaaatag 

This corresponds to the amino acid sequence <SEQ ID 3042; ORF 279.ng>: 
g279.pep 

1 MTRICGCLIS TVLSVSASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 
51 VRPTAAA LPA ITTCPGELKL TASTTSPCAD SAQICLTCSS SKPKMAAIAP 
101 TPCGTADCIS SARRRTSLTA SAKSNASAAT SAVYSPRLCP ATAAGVLPPT 
151 SK* 

ORF 279 shows 89.5% identity over a 152 aa overlap with a predicted ORF (ORF 279.nj 
fi*om N, gonorrhoeae: 



279 



gnm4.seq 



g279 



m2 79 .pep 
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70 80 90 100 110 120 

m279.pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 

II lllllllllllll I III: II :: ||||||| 

9279 ITTCPGELKLTASTTSPCADSAQI CLTCSSSKPKMAAI APTPCGTADCI SS ARRRTSLTA 

70 80 90 100 110 120 

130 140 150 

m279 . pep SAKFNAPAATSAVYSPRLCPATAAGVLPPASKX 

III II iiiiiiiiiiiiiiiiiiiiihiii 

g2 7 9 SAKSNASAATSAVYSPRLCPATAAGVLPPTSKX 
130 140 150 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 3043>: 

a279.seq 

1 ATGACNCNGA TTTGCGGCTG CTTGATTTCA ACGGTTTNNA GGGCTTCGGC 

51 GAGTTTGTCG GCGGCGGGTT TCATGAGGCT GCAATGGGAA GGTACNGACA 

101 C^5GGCAGCGG CAGGGCGCGT TTGGCGCCGG CTTCTTTGGC GGCAAGCATA 

151 GCGCGCTCGA CGGCGGCGGC ATTGCCTGCA ATCACGACTT GTCCGGGCGA 

201 GTTGAAGTTG ACGGCTTCAA CCACTTCATC CTGTGCGGAT TCGGCGCAAA 

251 TTTGTTTTAC CTGTTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

301 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA NGCGCACGAG 

351 TTTGACCGCG TCGGCAAAAT CCAATGCGCC GGCGGCAACN AGTGCGGTGT 

401 ATTCGCCGAN GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 

451 TCCGAATAG 

This corresponds to the amino acid sequence <SEQ ID 3044; ORF 279.a>: 

a279.pep 

1 MTXXCGCLIS TVXRASASLS AAGFMRLQWE GTDTGSGRAR LAPASLAASI 
51 ARSTAAALPA ITTCPGELKL TASTTSSCAD SAQICFTCSS SKPRIAAIAP 
101 TPCGTADCIS SARXRTSLTA SAKSNAPAAT SAVYSPXLCP ATAAGVLPPA 
151 SE* 

m279/a279 ORFs 279 and 279.a showed a 88.2% identity in 1 52 aa overly 

10 20 30 40 50 60 

m279.pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 

'I MIMMM illlMillll:lllillllllllll|||||||||::|| 
a279 MTXICGCLISTVXRASASLSAAGFMRLQWEGTDTGSGRARLAPASLAASIARSTAAALPA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m27 9 . pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 
I! IIMIIIIIIIII I III: :illllllll||||i||||||i||i||| |||||| 
ITTCPGELKLTASTTSSCADSAQICFTCSSSKPRIAAIAPTPCGTADCISSARXRTSLTA 
70 80 90 100 110 120 



a279 



130 140 150 

m27 9 . pep SAKFNAPAATSAVYSPRLCPATAAGVLPPASKX 
Ml llllllllllll lllllllllllll|:| 
a27 9 SAKSNAPAATSAVYSPXLCPATAAGVLPPASEX 

130 140 150 



519 and 519-1 gnm7.seq 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 3045>: 

m519.seq (partial) 

1 , .TCCGTTATCG GGCGTATGGA GTTGGACAAA ACGTTTGAAG AACGCGACGA 

51 AATCAACAGT ACTGTTGTTG CGGCTTTGGA CGAGGCGGCC GGGgCTTgGG 

101 GTGTGAAGGT TTTGCGTTAT GAGATTAAAG ACTTGGTTCC GCCGCAAGAA 

151 ATCCTTCGCT CAATGCAGGC GCAAATTACT GCCGAACGCG AAAAACGCGC 

201 CCGTATCGCC GAATCCGAAG GTCGTAAAAT CGAACAAATC AACCTTGCCA 



m 
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251 GTGGTCAGCG CGAAGCCGAA 

301 GCGGTCAATG CGTCAAATGC 

351 AGGTGAAGCG GAATCCTTGC 

401 TCCGTCAAAT TGCCGCCGCC 

451 AATCTGAAGA TTGCGGAACA 

501 AGAAAGCAAT ACGCTGATTA 

551 TGATTTCTGC CGGTATGAAA 



ATCCAACAAT CCGAAGGCGA GGCTCAGGCT 
CGAGAAAATC GCCCGCATCA ACCGCGCCAA 
GCCTTGTTGC CGAAGCCAAT GCCGAAGCCA 
CTTCAAACCC AAGGCGGTGC GGATGCGGTC 
ATACGTCGCT GCGTTCAACA ATCTTGCCAA 
TGCCCGCCAA TGTTGCCGAC ATCGGCAGCC 
ATTATCGACA GCAGCAAAAC CGCCAAaTAA 



This corresponds to the amino acid sequence <SEQ ID 3046; ORF 519>: 

m5l9.pep (partial) 

1 ..SVIGRMELDK TFEERDEINS TWAALDEAA GAWGVKVLRY EIKDLVPPQE 

51 ILRSMQAQIT AEREKRARIA ESEGRKIEQI NLASGQREAE IQQSEGEAQA 

101 AVNASNAEKI ARINRAKGEA ESLRLVAEAN AEAIRQIAAA LQTQGGADAV 

151 NLKIAEQYVA AFNNLAKESN TLIMPANVAD IGSLISAGMK IIDSSKTAK* 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 3047>: 

g519 . seq 

1 atggaatttt tcattatctt gttggcagcc gtcgccgttt tcggcttcaa 

51 atcctttgtc gtcatccccc agcaggaagt ccacgttgtc gaaaggctcg 

101 ggcgtttcca tcgcgccctg acggccggtt tgaatatttt gattcccttt 

151 atcgaccgcg tcgcctaccg ccattcgctg aaagaaatcc ctttagacgt 

201 acccagccag gtctgcatca cgcgcgataa tacgcaattg actgttgacg 

251 gcatcatcta tttccaagta accgatccca aactcgcctc atacggttcg 

3 01 agcaactaca ttatggcaat tacccagctt gcccaaacga cgctgcgttc 
351 cgttatcggg cgtatggagt tggacaaaac gtttgaagaa cgcgacgaaa 

4 01 tcaacagtac cgtcgtctcc gccctcgatg aagccgccgg ggcttggggt 
451 gtgaaagtcc tccgttacga aatcaaggat ttggttccgc cgcaagaaat 
501 ccttcgcgca atgcaggcac aaattaccgc cgaacgcgaa aaacgcgccc 
551 gtattgccga atccgaaggc cgtaaaatcg aacaaatcaa ccttgccagt 
601 ggtcagcgtg aagccgaaat ccaacaatcc gaaggcgagg ctcaggctgc 
651 ggtcaatgcg tccaatgccg agaaaatcgc ccgcatcaac cgcgccaaag 
701 gcgaagcgga atccctgcgc cttgttgccg aagccaatgc cgaagccaac 
751 cgtcaaattg ccgccgccct tcaaacccaa agcggggcgg atgcggtcaa 
801 tctgaagatt gcgggacaat acgttaccgc gttcaaaaat cttgccaaag 
851 aagacaatac gcggattaag cccgccaagg ttgccgaaat cgggaaccct 
901 aattttcggc ggcatgaaaa attttcgcca gaagcaaaaa cggccaaata 
951 a 

This corresponds to the amino acid sequence <SEQ ID 3048; ORF 519.ng>: 
g5l9 .pep 

1 MEFFIILLAA VAVFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDET^GAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

201 . GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESIiR LVAEANAEAN 

251 RQIAAALQTQ SGADAVNLKI AGQYVTAPKN LAKEDNTRIK PAKVAEIGNP 

301 NPRRHEKFSP EAKTAK* 



ORF 519 shows 87.5% identity over a 200 aa overlap with a predicted ORF (ORF 519.ng) 
fi*om N, gonorrhoeae: 

m519/g519 

10 20 30 

m519 .pep SVIGRMELDKTFEERDEINSTWAALDEAA 

Mllllllllllllllllllllhllllll 

g519 YFQVTDPKIASYGSSNYIMAITQIAQTTUISVIGRMELDKTFEERDEINSTVVSALDEAA 
90 100 110 120 130 140 



m5 1 9 . pep 



40 50 60 70 80 90 

GAWGVKVLRYEIKDLVPPQEILRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
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IIIIMIIIIIIIIII hllllllllM IIIIIIIIIIIMIIIIIIII 

g519 GAWGVKVUlYEIKDLVPPQEILRAMQAQITAEREKRARIAESEGRKIEQlNIiASGQREAE 
150 160 170 180 190 200 

100 110 120 130 140 150 

m5 1 9 . pep IQQSEGEAQAAVNASNAEKI ARINRAKGEAESLRLVAEANAEAIRQI A/^LQTQGGADAV 

IIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII lllllllllhlllll 

g519 IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEANRQIAAALQTQSGADAV 
210 220 230 240 250 260 

160 170 180 190 200 

m519 .pep NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL-ISAGMKIIDSSKTAK 
Mill llhlhllllhll I Ihlhll: : |: :|MI 

g519 NLKIAGQYVTAFKNLAKEDNTRIKPAKVAEIGNPNFRRHEKFSPEAKTAK 
270 280 290 300 310 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 3049>: 

a519. seq 

1 ATGGAATTTT TCATTATCTT GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

401 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 

451 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

7 01 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3050; ORF 519.a>: 

a519.pep 

1 MEFFIILLAA VWFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREMIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 



m519/a519 ORFs 519 and 519. a showed a 99.5% identity in 199 aa overlap 

10 20 30 

m51 9 . pep SVIGRMELDKTFEERDEINSTWAALDEAA 

M I I I I I I t I I I I I I I I I I I I I I : I I I I I I 
a519 YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIGRMELDKTFEERDEINSTWSALDEAA 
90 100 110 120 130 140 

40 50 60 70 80 90 

m519.pep GAWGVKVLRYEIKDLVPPQEILRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
■IIIIIIIIIIMIIIIIIIItlllllllllllllllllllMiiiMIIIIIIIIIIM 
a519 GAWGVKVLRYEIKDLVPPQEILRSMQAQXTAEREKRARIAESEGRKIEQINLASGQREAE 
150 160 170 180 190 200 



100 110 120 130 140 
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mSl 9 . pep IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEAIRQIAAALQTQGGADAV 

ItlllllllMIIMMIMIII llllllllllilllllllllllllllMII 

a519 IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEAIRQIAAALQTQGGADAV 
210 220 230 240 250 260 

160 170 180 190 200 

m5l9 .pep NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSLISAGMKIIDSSKTAKX 
IMIiMMMIMMIIIIIItljlllllllllllllllllllMliii 
a519 NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSLISAGMKIIDSSKTAKX 
270 280 290 300 310 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 3051>: 

m519-l . seq 

1 ATGGAATTTT TCATTATCTT GTTGGTAGCC GTCGCCGTTT TCGGTTTCAA 

51 ATCCTTTGTT GTCATCCCAC AACAGGAAGT CCACGTTGTC GA7VAGGCTGG 

101 GGCGTTTCCA TCGCGCCCTG ACGGcCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GCATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

401 TCAACAGTAC TGTTGTTGCG GCTTTGGACG AGGCGGCCGG GGCTTGGGGT 

451 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC CGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCTGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3052; ORF 519-1>: 

m519-l. 

1 MEFFIILLVA VAVFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWA ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 

The following DNA sequence was identified in K gonorrhoeae <SEQ ID 3053>: 

g519-l.seq 

1 ATGGAATTTT TCATTATCTT GTTGGCAGCC GTCGCCGTTT TCGGCTTCAA 

51 ATCCTTTGTC GTCATCCCCC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGATAA TACGCAATTG ACTGTTGACG 

251 GCATCATCTA TTTCCAAGTA ACCGATCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCAAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

401 TCAACAGTAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG GGCTTGGGGT 

451 GTGAAAGTCC TCCGTTACGA AATCAAGGAT TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCGCA ATGCAGGCAC AAATTACCGC CGAACGCGAA AAACGCGCCC 

551 GTATTGCCGA ATCCGAAGGC CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGTG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCCAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GCGAAGCGGA ATCCCTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGGGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTAGCCGC GTTCAACAAT CTTGCCAAAG 
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851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GCATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3054; ORF 519-l.ng>: 

g519-l.pep 

1 t4EFFIILLAA VAVFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQXTAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 



iaS19-l/g519-l ORFs 519-1 and 519-1. ng showed a 99.0% identity in 315 aa 
overlap 

10 20 30 40 50 60 

g519-l.pep MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
lit IIIIMIIMIIMIIIIiMltllllllllllliiiiiiii Hill Mil II 11,1 

m5l9-l MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
10 20 30 40 50 60 

70 80 90 100 110 120 

g519-l.pep KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKIASYGSSNYIMAITQLAQTTLRSVIG 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II n 1 1 1 1 1 1 1 1 

m519-l KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
70 80 90 100 110 120 

130 140 150 160 170 180 

g519-l,pep rmeldktfeerdeinstwsaldeaagawgvkvlryeikdlvppqeilramqaqitaere 

llllllllinil)lllll:lllltllllltlll|||||||j|tltlt|:||||l|l||| 

rmeldktfeerdeinstwaaldeaagawgvkvlryeikdlvppqeilrsmqaqitaere 

130 140 150 160 170 180 



m519-l 



190 200 210 220 230 240 

g5i9-i.pep krariaesegrkieqinlasgqreaeiqqsegeaqaavnasnaekiarinrakgeaeslr 
llllllllllllllltlllMllllllllltlllllilllllllllMIIMIIIIIIII 
m5i 9- 1 krariaesegrkieqinlasgqreaeiqqsegeaqaavnasnaekiarinrakgeaeslr 

190 200 210 220 230 240 

250 260 270 280 290 300 

g51 9-1 . pep LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I I N I M II I I II I I I II I I II I I I I I I I I I I II I I I I I I I I II I I I I I II 11 I , II II , 

m519-l lvaeanaeairqiaaalqtqggadavnlkiaeqyvaafnnlakesntlimpanvadigsl 

250 260 270 280 290 300 

310 

g519-l.pep ISAGMKIIDSSKTAKX 
II t I I I I I I II I I I I I 
m5 1 9 - 1 I S AGMK 1 1 DS SKTAKX 

310 

The following DNA sequence was identified in A^. meningitidis <SEQ ID 3055>: 

a519-l.seq 



1 


ATGGAATTTT 


51 


ATCCTTTGTT 


101 


GGCGTTTCCA 


151 


ATCGACCGCG 


201 


ACCCAGCCAG 


251 


GTATCATCTA 


301 


AGCAACTACA 


351 


CGTTATCGGG 
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4 01 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3056; ORF 519-La>: 

a519-l.pep. 

1 MEFFIILLAA WVHX5 FKSFV . VIPOOEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 

m519-l/a519-l ORFs 519-1 and 519-1. a showed a 99.0% identity in 315 aa 

overlap 

10 20 30 40 50 60 

a519-l.pep MEFFIILLAAVWPGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
I I I I i I I I : I I : I I I I I I I I I I t I I I I I I I I t I i I } i { I I I I i I I I t I I I I I i I I I I I I I 
m519-l MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
10 20 30 40 50 60 , 

70 80 90 100 110 120 

a519-l.pep KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
MMIIMIIIIIIIIIIIINIIIIIItllllllllllllllliNMIIIIIMIIII 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAXTQLAQTTLRSVIG 
70 80 90 100 110 120 



in519-l 



130 140 150 160 170 180 

aSl 9-1 . pep RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

lltllllllllllllMlhllllllMIMii I I I I t I t I I I I I I I I I I j I I I 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
130 140 150 160 170 180 



m519-l 



190 200 210 220 230 240 

a519-l . pep KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
I I I I I I I I t I I I I I I I I I I I I I i M I I I i I I I I I I I I I I I I I I I I I I I ] I I I I I I I I I I I 

KRARIAESEGRKIEQI^3LASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
190 200 210 220 230 240 



m519-l 



250 260 270 280 290 300 

a519-l.pep LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNN LAKESNTLIM PANVADIGSL 

INIIItlllllll llllllllltllllllllllMIIIIMIMIIIIItllll 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
250 260 270 280 290 300 



m519-l 



310 

a519-l.pep I S AGMKI IDS SKT AKX 
I I M I I I I I I I I I I M 
m519-l ISAGMKIIDSSKTAKX 
310 
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The following partial DNA sequence was identified in N, meningitidis <SEQ ID 3057>: 

m576.seq.. (partial) 

1 ..ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

51 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

101 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

151 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

201 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

251 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

301 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

351 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

401 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

4 51 GTGATTCCGG GTTGGACCGA AGgCGTACAG CTTCTGAAAG AAGGCGGCGA 

501 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

551 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

601 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

651 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3058; ORF 576>: 

m576.pep. . (partial) 

1 . .MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 
51 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 
101 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 
151 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 
201 KIGAPENAPA KQPAQVDIKK VN* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3059>: 

g576.3eq. . (partial) 

acatcggacg ctccctgaaa caaatgaagg aacagggcgc 
ttgaaagtct ttaccgatgc catgcaggca gtgtatgacg 
caaaatgacc gaagagcagg cccaggaagt gatgatgaaa 
agcagcaggc taaagccgta gaaaaacaca aggcggatgc 
aaagaaaaag gcgaagcctt cctgaaggaa aatgccgccg 
gaagaccact gcttccggtc tgcagtacaa aatcaccaaa 
gcaaacagcc gacaaaagac gacatcgtta ccgtggaata 
ctgattgacg gtaccgtatt cgacagcagc aaagccaacg 
caccttccct ttgagccaag tgattccggg ttggaccgaa 
ttctgaaaga aggcggcgaa gccacgttct acatcccgtc 
taccgcgaac agggtgcggg cgaaaaaatc ggtccgaacg 
atttgacgtg aaactggtca aaatcggcgc acccgaaaac 
agcagccgga tcaagtcgac atcaaaaaag taaattaa 

This corresponds to the amino acid sequence <SEQ ID 3060; ORF 576.ng>: 

g576.pep. . (partial) 

1 ..MGVDIGRSLK QMKEQGAEID LKVFTDAMQA VYDGKEIKMT EEQAQEVMMK 
51 FLQEQQAKAV EKHKADAKAN KEKGEAFLKE NAAEDGVKTT ASGLQYKITK 
101 QGEGKQPTKD DIVTVEYEGR LIDGTVFDSS KANGGPATFP LSQVIPGWTE 
151 GVRLLKEGGE ATFYIPSNLA YREQGAGEKI GPNATLVFDV KLVKIGAPEN 
201 APAKQPDQVD IKKVN* 



1 


. .atgggcgtgg 


51 


ggaaatcgat 


101 


gcaaagaaat 


151 


ttcctgcagg 


201 


gaaggccaac 


251 


aagacggcgt 


301 


cagggtgaag 


351 


cgaaggccgc 


401 


gcggcccggc 


451 


ggcgtacggc 


501 


caaccttgcc 


551 


ccactttggt 


601 


gcgcccgcca 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

m576/g576 ORFs 576 and 57 6, ng showed a 97.2% identity in 215 aa overlap 

10 20 30 40 50 60 

m57 6 . pep MQQASYAMGVDIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQ 
>> X I < I > I t < I I I I I I I I I I I I I I : I I I I I I I I I I I [ I I I I I I I I I 
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g57 6 MGVDIGRSLKQMKEQGAEIDLKVFTDAMQAVYDGKEIKMTEEQAQEVMMKFLQ 

10 20 30 40 50 

70 80 90 100 110 120 

m57 6 . pep EQQAKAVEKHKADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIV 

IINIIIIIIIIIItlllllllllllMlhllllllllllillllili Mill 

g57 6 EQQAKAVEKHKADAKANKEKGEAFLKENAAEDGVKTTASGLQYKITKQGEGKQPTKDDIV 
60 70 80 90 100 110 

130 140 150 160 170 180 

m576 . pep TVEYEGRLIDGTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYRE 

III) Illllllllhillllllllllllll: Ill iitl 

g57 6 TVEYEGRLIDGTVFDSSKANGGPATFPLSQVIPGWTEGVRLLKEGGEATFYIPSNLAYRE 
120 130 140 150 160 170 

190 200 210 220 

m57 6 . pep QGAGDKIGPNATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
i I I I I I I i I I I I I I I I I I I I I I I II M I I I II I I I I I I I II 
g57 6 QGAGEKIGPNATLVFDVKLVKIGAPENAPAKQPDQVDIKKVNX 
180 190 200 210 

The following partial DNA sequence was identified inN, meningitidis <SEQ ID 306 1>: 

a57 6.seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

401 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

451 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCTGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCG7VA CAGGGTGCGG 

701 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAT^ GTAAATTAA 



This corresponds to the amino acid sequence <SEQ ID 3062; ORF 576.a>: 

a57 6.pep 

1 MNTIFKISAL TLSAALALSA CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 



m576/a576 ORFs 576 and 576, a showed a 99.5% identity in 222 aa overlap 

10 20 30 

m57 6 . pep MQQAS YAMGVDIGRSLKQMKEQGAEI DLKV 

I I I I I M I I I I I I I II I M I I I I I I I I I j I 
a576 CGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGVDIGRSLKQMKEQGAEIDLKV 
30 40 50 60 70 80 

40 50 60 70 80 90 

m57 6 . pep FTEAMQAVYDGKE IKMTEEQAQEVMMKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAA 

>> I I I II I I II I I I II I I I I I I M I I I I I I I II I I I I I I I II 

a57 6 ^EIAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAA 

90 100 110 120 130 140 
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100 110 120 130 140 150 

m576.pep KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLIDGTVFDSSKANGGPVTFPLSQ 

tIMIIIIItllMIIIIIIIIIIIIIMIIIIIIIIillllltll Illllllll 

a576 KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLIDGTVFDSSKANGGPVTFPLSQ 
ISO 160 170 180 190 200 

160 170 180 190 200 210 

m57 6 . pep VIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 
If > f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I ] I I I I I I I j I ,,, , 
a57 6 VILGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

210 220 230 240 250 260 

220 

m57 6 . pep KQPAQVDIKKVNX 
I I I I I I I I I I I I I 
a57 6 KQPAQVDIKKVNX 
270 



Further work revealed the DNA sequence identified in N, meningitidis <SEQ ID 3063>; 

in576-l . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAiU\AA GGCGAAGCCT 

401 TTCTGAAAGA 7VAATGCCGCC AAAGACGGCG TG7VAGACCAC TGCTTCCGGC 

451 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3064; ORF 576-l>: 

m57 6-l.pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

The following DNA sequence was identified in A^. gonorrhoeae <SEQ ID 3065>: 

g576-l. seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTGCCGCG CAGGGCGACA CCTCTTCAAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC AATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 ACAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGATG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCCCAGGAAG TGATGATGAA ATTCCTGCAG GAGCAGCAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGATG CGAAGGCCAA CAAAGAAAAA GGCGAAGCCT 

401 TCCTGAAGGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGT 

451 CTGCAGTACA AAATCACCAA ACAGGGTGAA GGCAAACAGC CGACAAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACCGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG CCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACGG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 
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7 01 GCGAAAAAAT CGGTCCGAAC GCCACTTTGG TATTTGACGT GAAACTGGTC 
751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG ATCAAGTCGA 
801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3066; ORF 576-1. ng>: 

g576-l .pep 

1 MNTIFKISAL TLSAALALSA CGKKEAAPAS ASEPAAASAA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTDAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPATFPLSQ 

201 VIPGWTEGVR LLKEGGEATF YIPSNLAYRE QGAGEKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPDQVDIKK VN* 



g576-l/iu576-l ORFs 576-1 and 57 6-l.ng showed a 97.8% identity in 272 aa 

overlap 

10 20 30 40 50 60 

g57 6-1 . pep MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASAAQGDTSSIGSTMQQASYAMGV 

II Mill I lllltllllMIIIIII Illllllll 

m576-l MNTIFKI S ALTLS AALALS ACGKKE AAPAS ASE PAAAS S AQG DTS S I G S TMQQAS YAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 

g57 6-1 . pep DIGRSLKQMKEQGAEIDLKVFTDAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 

mil I IMIIhllllllllllllliiitlllillMlllillllMll 

DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 
"70 80 90 100 110 120 



m57 6-l 



130. 140 150 160 170 180 

g57 6-1 . pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 

> > I I I I I I I I I I I I I I I t I i I I I I I I I I I I I I I I I I I I I I I 

KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
130 140 150 160 170 180 



m57 6-l 



190 200 210 220 230 240 

g57 6-1 . pep GTVFDSSKANGGPATFPLSQV I PGWTEGVRLLKEGGEATFYIPSNLAYRE QGAGEKIGPN 

I < I I > > I t I M M •- M I I I M I I I : I I I I I I I I I I I t I M I I I I I I I I I : I I I I I 

GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 
190 200 210 220 230 240 



in57 6-l 



250 260 270 

g57 6-1 . pep ATLVFDVKLVKIGAPENAPAKQPDQVDIKKVNX 

I I I I I M I I I I I [Ill Illllllll 

m57 6-1 ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 

250 260 270 

The following DNA sequence was identified in A^. meningitidis <SEQ ID 3067>: 

a57 6-l . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

401 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCTGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 



# 
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701 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 
751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 
801 CATCAA7\AAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3068; ORF 576-1 .a>: 

a57 6-l.pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

a576-l/m576-l ORFs 57 6-1 and 57 6-1. a showed a 99.6% identity in 272 

overlap 

10 20 30 40 50 60 

a576-l . pep MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGV 
■■llltllltllllllllilllllMllltlMlllllllllllltllilMiMii j|| 
m576-l MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 

a57 6-1 . pep DIGRSLKQMKEQGAEIDLKVFTEAMQAVYCGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 
M I I I I I I I I I I I I I I I I I I I I I I I t I I I I I i I I I I I I I i I I I I I M I I I I I I I I I I I I I 

DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 
70 80 90 100 110 120 



m57 6-l 



130 140 150 160 170 180 
a57 6-1 . pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
I I i I I M I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
130 140 150 160 170 IBO 



m57 6-l 



190 200 210 220 230 240 
a57 6-l . pep GTVFDSSKANGGPVTFPLSQVILGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 
OlMIIMIIIIflllllMI llllllltlllllllllllMIII II 

GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 
190 200 210 220 230 240 



m57 6-l 



250 260 270 

a57 6-1 . pep ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
i I I I I I I 1 I I I I I I I I I II I I II I I I t t II I II 
m57 6-1 ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 

250 260 270 



919 gmn43.seq 



The following partial DNA sequence was identified in Kmeningitidis <SEQ ID 3069>: 

m919.seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TACGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTC TATACCGTTG TACCGCACCT 

201 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCTTTCAGG CAAAACAGTT 

351 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAGCCTTG 

401 CCGGTACGGT TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGACAGG 



wo 99/57280 




PCT/US99/09346 



84 



451 CGGACGGCAC AAGCCCGCTT CCCGATTTAC GGTATTCCCX3 ACGATTTTAT 

501 CTCCGTCCCC CTGCCTGCCX3 GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 

551 TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 

601 CATACCGCCG ACCTCTCCcG ATTCCCCATC ACCGCGCGCA CAACAGCAAT 

651 CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 

701 AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 

751 GAAGACCCTG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 

801 GAAAACCCCG TCCGGCAAAT ACATCCGCAT CGGCTATGCC GACAAAAACG 

851 AACATCCyTA CGTTTCCATC GGACGCTATA TGGCGGATAA GGGCTACCTC 

901 AAACTCGGAC AAACCTCCAT GCT^GGGCATT AAGTCTTATA TGCGGCAAAA 

951 TCCX3CAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 

1001 TCCGCGAGCT TGCCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 

1051 ACGCCGCTGA TGGGGGAATA TGCCGGCGCA GTCGACCGGC ACTACATTAC 

1101 CTTGGGTGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGATAC GGCX3ACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTC CTACCCAACG 

1301 GTATGAAGCC CGAATACCGc CCGTAA 



This corresponds to the amino acid sequence <SEQ ID 3070; ORF 91 9>: 

m919 .pep 

1 MKKYLFRAAL YGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SFQAKQFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSRFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KliGQTSMQGI KSYMRQNPQR LAEVLGQNPS YIFFRELAGS SNDGPVGALG 

3 51 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

The following partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 307 1>: 

g919 . seq 

1 ATGAAAAAAC ACCTGCTCCG CTCCGCCCTG TACGGcatCG CCGCCgccAT 

51 CctcgCCGCC TGCCAAAgca gGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG CCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TTGCCGGCGG CGGGGCCGTC TATACCGTTG TGCCGCACCT 

201 GTCCATGCCC CACTGGGCGG CGCaggATTT TGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

3 01 TGCGCCCAAG CCTTTCAAAC CCCCX5TGCAT TCCTTTCAGG CAAAGcGgTT 

351 TTTTGAACGC TATTTCACGC cgtGGCaggt tgcaggcaAC GGAAGcCTTG 

401 Caggtacggt TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGGCAGG 

451 CGGACGGAAC GGGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCG CTGCCTGCCG GTTTGCGGGG CGGAAATUU^C CTTGTCCGCA 

551 TCAGGCAGac ggGGAAAAAC AGCGGCACGA TCGACAATGC CGGCGGCACG 

601 CATACCGCCG ACCTCTCCCG ATTCCCCATC ACCGCGCGCA CAACGGcaat 

651 caaaGGCAGG TTTGAaggAA GCCGCTTCCT CCCTTACCAC ACGCGCAACC 

701 AAAtcaacGG CGGCgcgcTT GACGGCAAag cccCCATCCT CggttacgcC 

751 GAagaccCcG tcgaacttTT TTTCATGCAC AtccaaggCT CGGGCCGCCT 

801 GAAAACCCcg tccggcaaat acatCCGCAt cggaTacgcc gacAAAAACG 

851 AACAtccgTa tgtttccatc ggACGctaTA TGGCGGACAA AGGCTACCTC 

901 AAGctcgggc agACCTCGAT GCAGGgcatc aaagcCTATA TGCGGCAAAA 

951 TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 

1001 TCCGCGAGCT TGCCGGAAGC GGCAATGAGG GCCCCGTCGG CGCACTGGGC 

1051 ACGCCACTGA TGGGGGAATA CGCCGGCGCA ATCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CAGGCAGCGC GATCAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGTTAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATACGT CTGGCAGCTC CTGCCCAACG 

1301 GCATGAAGCC CGAATACCGC CCGTGA 
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This corresponds to the amino acid sequence <SEQ ID 3072; ORF 919,ng>: 

g919.pep 

1 MKKHLLRSAL YGIAAAILAA CQSRSIQTFP QPDTSVINGP DRPAGIPDPA 

51 GTTVAGGGAV YTWPHLSMP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SFQAKRFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDGR 

151 RTERARFPIY GIPDDFISVP LPAGLRGGKN LVRIRQTGKN SGTIDNAGGT 

201 HTADLSRFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

3 01 KLGQTSMQGI KAYMRQNPQR LAEVLGQNPS YIFFRELAGS GNEGPVGALG 

351 TPLMGEYAGA IDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 



ORF 919 shows 95.9 % identity over a 441 aa overlap v^th a predicted ORF (ORF 919.ng) 
from A'; gonorrhoeae: 

m919/g919 



m919 .pep 
g919 

m919 .pep 
g919 

m919 .pep 
g919 



10 20 30 40 50 60 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

I I I ^ I H H I I I I I I I I h I I I I I I I I I I I I I I : I I I I I I I I I I : I I I I I 

MKKHIiLRSALYGIAAAIIiAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 



10 



20 



30 



40 



50 



60 



70 80 90 100 110 120 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

IIIIIIIIHIIIIIIIIIIIIIIIII III Mil lllllMMi III mill ||:|||| 

YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 
70 80 90 100 110 120 

130 140 150 160 170 180 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

I I I I I I I II I I II I I I II i III : I I I I I I I I I I I I II I I I I I I I I : I I 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 

130 140 150 160 170 180 



190 200 210 220 230 240 

Tn91 9 . pep LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

II IMIMnhllllllllllllllllllllllllllll Mllllllll 

g 9 1 9 LVRI RQTGKNSGTI DNAGGTHTADLSRFP I TARTTAI KGRFEGSRFLP YHTRNQ INGGAL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m919 . pep DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKKTEHPYVSIGRYMADKGYL 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
g 9 1 9 DGKAPI LGYAEDPVELFFMHI QGSGRLKTPSGKYI RI GYADKNEHP YVS I GRYMADKGYL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m919 . pep KLGQTSMQGI KS YMRQNPQRIAEVLGQNPSYIFFREIAGSSNDGPVGALGTPLMGEYAGA 

III HI II IllllllllhhIII I 

g919 KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 

310 320 330 340 350 360 

370 380 390 400 410 420 

m919 . pep VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 

HIIIIIIIIIIIIIIMIIIIIIIIillilllllllMllliiiiiiiiiiiiiiiiii 

g919 IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDE^ 
370 380 390 400 



410 



420 
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430 440 
m919 .pep QKTTGYVWQLLPNGMKPEYRPX 

lllllllllillllllllllll 
g919 QKTTGYVWQLLPNGMKPEYRPX 
430 440 



The following partial DNA sequence was identified in Kmeningitidis <SEQ ID 3073>: 



a919 . seq 




1 


ATGAAAAAAT 


51 


CCTCGCCGCC 


101 


CATCCGTCAT 


151 


GGAACGACGG 


201 


GTCCCTGCCC 


251 


X \u>V.<V3\urVrf X V_ O VJ 


301 


TGCGCCCAAG 


351 


TTTTGAACGC 


401 


CCGGTACGGT 


451 


CGGACGGCAC 


501 


CTCCGTCCCC 


551 


TCAGGCAGAC 


601 


CATACCGCCG 


651 


CAAAGGCAGG 


701 


AAATCAACGG 


751 


GAAGACCCCG 


801 


GAAAACCCCG 


851 


AACATCCCTA 


901 


AAGCTCGGGC 


951 


CCCGC7VACGC 


1001 


TCCGAGAGCT 


1051 


ACGCGGCTGA 


1101 


CTTGGGCGCG 


1151 


CCCTCAACCG 


1201 


GCGGTGCGCG 


1251 


TGCCGGCAAA 


1301 


GTATG/VAGCC 



AGACCTCGAT GCAGGGCATC AAAGCCTATA TGCAGCAAAA 
CTCGCCGAAG TTTTGGGGCA AAACCCCAGC TATATCTTTT 
TACCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 
TGGGCGAGTA CGCCGGCGCA GTCGACCGGC ACTACATTAC 
CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 
CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 
TGGATTATTT TTGGGGATAC GGCGACG7VAG CCGGCGAACT 
CAGAAAACCA CGGGATATGT CTGGCAGCTT CTGCCCAACG 
CGAATACCGC CCGTAA 

This corresponds to the amino acid sequence <SEQ DD 3074; ORF 919.a>: 

a919.pep 

1 MKKYLFR7VAL CGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SVQAKQFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSQFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KLGQTSMQGI KAYMQQNPQR LAEVLGQNPS YIFFRELTGS SNDGPVGALG 

351 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

4 01 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

m919/a919 ORFs 919 and 919.a showed a 98.6% identity in 441 aa overlap 

10 20 30 40 50 60 

m9l9 . pep MICKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

NMIIIIII 1 I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I M I I I I t 

a919 MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m9l9 . pep YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
llllllillllllllllllllllllljllllll Ill ,1,1 lliiiiii 

a919 YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
"^0 80 90 100 110 120 
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130 140 150 160 170 180 

m919 . pep YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

IIIIMI llllll lllllltlllllllllMllllllllilllllllllllllll 

a919 YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

130 140 150 160 170 IBO 

190 200 210 220 230 240 

m91 9 . pep LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

IIIIIIIMIIMMIIIIil Illlllll 

a919 LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m919 . pep DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

Illtllll IIIMllltMIIIIIIIIIIIIIM 

a919 DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m91 9 . pep KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

Illlllllllll: IIIIIIMIII 

8919 KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

310 320 330 340 350 360 

370 380 390 400 410 420 

m91 9 . pep VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
I I I M I f I I I i I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I j I I I I I I I I I I I I 
a 91 9 VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 

370 380 390 400 410 420 

430 440 
m9 1 9 . pep QKTTGYVWQLLPNGMKPE YRPX 

I I I I I I I I I I I I I I I t I I I I I I 
a 9 1 9 QKTTGYVWQLLPNGMKPEYRPX 
430 440 



121 and 121-1 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3075>; 

ml21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGxxxxxxx xxxxxxxxxx xxxxxxkxxx xxxxxxxxxx 

401 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

451 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

501 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

551 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

601 xxxxxxCAGC TTCCTTACGA CAAAAACGGT GCAAAGTCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACGCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCAT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGACG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGnATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 
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1051 GCAACCGGCG CATCCAAACC GTGTATTCTG AnCGCGGGAT ATTATTATTG 
1101 A 

This corresponds to the amino acid sequence <SEQ ID 3076; ORF 121>: 

ml21 .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3077>: 

gl21 . seq 

1 ATGGAAACAC 

51 GGCGGATGCC 

101 AAGGGCACGC 

151 GATTTGCAGG 

201 GCAAGAACTC 

251 GTCAAAACCT 

301 ACCGTCCGAC 

351 GCCGCTGCTG 

401 GCCGCGACCT 

4 51 CACGAAGCCC 

501 CGGCGGGATT 

551 GCTTCGACAC 

601 cacTGGcagc 

651 catatTGCcg 

701 AACCCcaccc 

751 gaaacctAcc 

801 ttcccgattc 

851 CAGATGCCCG 

901 TTAATGGCGG 

951 CACCGCCGAA 

1001 cgtggttggC 

1051 GCGACCGGCG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3078; ORF 121. ng>: 

gl21.pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRWLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (0RF121.ng) 
from A^. gonorrhoeae: 

ml21/gl21 

10 20 30 40 50 60 

ml21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 

I I I I M I M I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I : I I I 

gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 
10 20 30 40 50 60 

"^0 80 90 100 110 120 

na21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
11)1:111111 IIIIIIMM I I I I I I I I I I I I I I I I I M 



AGCTTTACAT CGGCATTATG 
GTGCTGGTAC GGATGGACGG 
CTTTACCCCC TACCCTGACC 
ACACAGGCAC AGACGAACTG 
AGCCGCCTGT ACGCGCAAAC 
CGCTCCGTGC GACATTACCG 
ACGCGCCGGA ACACGGTtac 
GCGGAACTGa cgcggatttT 
TGCTGCCGGC GGacaAGGTG 
TGTTCCGCGA TGACAGGGAA 
GCCAACATCA GCGTACTCCC 
AGGGCCGGGC AATATGCTGA 
TGCCTTACGA CAAAAacggt 
cAACTGCTCG gcaggctGCT 
aaAAAGCACG GGgcGCGaac 
ttgacggcgg cgaaaaccga 
accgcgcaaA ccgTttggga 
TCAAATGTAC ATTTGCGGCG 
ATTTGGCAGA ATGTTTCGGC 
CTGAACCTCG ATCCTCAATG 
GGCGTGTTGG ATTAACCGCA 
CATCCAAACC GTGTATTCTG 



TCGGGAACCA GTATGGACGG 
CGGCAAATGG CTGGGCGCGG 
GGTTGCGCCG CAAATTGCTG 
CACCGCAGCA GGATGTTGTC 
CGCCGCCGAA CTGCTGTGCA 
CCCTCGGCTG CCACGGGCAA 
AGCATACAGC TTGCCGATTT 
TACCGTCggc gacttcCGCA 
CGCCGCTCGT CCCCGCCTTT 
ACACGCGTGG TACTGAACAT 
CCCCGGCGCA CCCGCCTTCG 
TGGAcgcgtg gacgcaggca 
gcAAAGgcgg cacAAGGCAA 
CGCCcaccCG TATTTCTCAC 
TgtttgcccT AAattggctc 
tacgacgtat tgcggacgct 
cgccgtctca CACGCAGCGG 
GCGGCATCCG CAATCCTGTT 
ACACGCGTTT CCCTGCACAG 
GGTGGAGGCG gccgCATTtg 
TTCCCGGTAG TCCGCACAAA 
GGCGCGGGAT ATTATTATTG 



wo 99/57280 




PCTAJS99/09346 



gl21 

ml21 .pep 
gl21 

ml21 .pep 
gl21 

ml21 .pep 
gl21 

ml21.pep 
gl21 

ml21 .pep 
gl21 



HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 
'^^ 80 90 100 110 120 

130 140 150 160 170 180 

^ocxjaxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

AELTRIFTVGDFRSRDIAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 
130 140 150 160 170 180 

190 200 210 220 230 240 

XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 

' = Illhlllll 11111111:1 lllll 

PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
(IMMHillllllllllllllllllllllllll I I I I i I I I I I I I I I I I Mlllll 
GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 



• *• ' w £.0\J JUU 

310 320 330 340 350 360 

LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 

>[ IIIIMIIIIhllllllllMI III Illlllllllllllllllll 

LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRI PGS PHKATGASKPCIL 
310 320 330 340 350 360 



XAGYYYX 
I [ I I I I 
GAGYYYX 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 3079>: 



al21. seq 








1 


ATGGAAACAC 


AGCTTTACAT 


CGGCATCATG 


51 


GGCGGATGCC 


GTACTGATAC 


GGATGGACGG 


101 


AAGGGCACGC 


CTTTACCCCC 


TACCCCGGCA 


151 


GATTTGCAGG 


ACACAGGCGC 


GGACGAACTG 


201 


GCT^GAACTC 


AGCCGCCTGT 


ACGCGCA7\AC 


251 


GTCAAAACCT 


CGCGCCGTCC 


GACATTACCG 


301 


ACCGTCAGAC 


ACGCGCCGGA 


ACACAGTTAC 


351 


GCCGCTGCTG 


GCGGAACGGA 


CTCAGATTTT 


401 


GCCGCGACCT 


TGCGGCCGGC 


GGACAAGGCG 


451 


CACGAAGCCC 


TGTTCCGCGA 


CGACAGGGAA 


501 


CGGCGGGATT 


GCC7VACATCA 


GCGTACTCCC 


551 


GCTTCGACAC 


AGGACCGGGC 


AATATGCTGA 


601 


CACTGGCAGC 


TTCCTTACGA 


CAAAAACGGT 


651 


CATATTGCCG 


CAACTGCTCG 


ACAGGCTGCT 


701 


AACCCCACCC 


TAAAAGCACG 


GGGCGCGAAC 


751 


GAAACCTACC 


TTGACGGCGG 


CGAAAACCGA 


801 


TTCCCGATTC 


ACCGCGCAAA 


CCGTTTTCGA 


851 


CAGATGCCCG 


TCAAATGTAC 


ATTTGCGGCG 


901 


TTAATGGCGG 


ATTTGGCAGA 


ATGTTTCGGC 


951 


CACCGCCGAA 


CTGAACCTCG 


ATCCGCAATG 


1001 


CATGGATGGC 


GGCGTGTTGG 


GTCAACCGCA 


1051 


GCAACCGGCG 


CATCCAAACC 


GTGTATTCTG 


1101 


A 







This corresponds to the amino acid sequence <SEQ ID 3080; ORF 121. a>: 

al21.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 
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301 
351 



LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 
ATGASKPCIL GAGYYY* 



ml21/al21 ORFs 121 and 121. a showed a 74.0% identity in 366 aa overlap 

10 20 30 40 50 60 

ml 21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 

IIDIitllllllllllllMt Illllllllllll : Illlil 

al21 ^5ETQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 

llll*IMII[lllllllll||||||[||||||||||||i 111:11:1111)111 

al21 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
al21 AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
' MMIIII1I:||IIIIIIIIIIIIII[|||| Mill 

al21 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21 .pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 

11)111:11 Illlllillllll Illlllllltl II 

al21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21 . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 

■MMItllllllllMII:!! I III : I I II : I || M I I II I I II 

al21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21, pep XAGYYYX 
I I I I I I 

a 121 GAGYYYX 



identified inN. meningitidis <SEQ ID 308 1>: 



Further work revealed the DNA sequence 

ml21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC 

351 GCCGCTGCTG GCGG7VACGGA CGCGGATTTT TACCGTCGGC 

401 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT 

451 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA 



GGATGGACGG 
CTGGGCGCGG 
CCAATTGCTG 
GGATTTTGTC 
CTGCTGTGCA 
CCACGGGCAA 
TTGCCGATTT 
GACTTCCGCA 
CCCCGCCTTT 
TACTGAACAT 
CCCGCCTTCG 
GACGCAGGCA 
CACAAGGCAA 
TATTTCGCAC 
AAATTGGCTC 
TGCGGACGCT 
CACGCAGCGG 
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851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGNATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG ANCGCGGGAT ATTATTATTG 

1101 A 



This corresponds to the amino acid sequence <SEQ ID 3082; ORF 121-1>: 

ml21-l.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAETP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AERTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDNRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAAD/VRQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

ml21-l/gl21 ORFs 121-1 and 121. ng showed a 95.6% identity in 366 aa ove; 

10 20 30 40 50 60 

ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 

M I I I I I I I I I I I I I : I I I I I I I I I I I M I I I I I I i I I I: I M I I I I I : I I I 

gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 



■70 80 90 100 110. 120 

ml21-l . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 

I I I I : I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l . pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGI ANISVLPPDA 

II I I I I I I II II i I I I II I I I I I I I I I i I I I : I I I I : I I I I I I i I 

gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l . pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

II t I I I I I i I I I I I I I I I I I I I I I I I I I I I I 

gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l . pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 

III! Milllllllllllllll IN IIMiiil Illlllll 

gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

na21-l . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 

I MIIIIII:ilMIIIII INI llllltlMI 

gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 



The following DNA sequence was identified in N, meningitidis <SEQ ED 308 3>: 

al21-l.seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 
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92 



51 


GGCGGATGCC 


101 


AAGGGCACGC 


151 


GATTTGCAGG 


201 


GCAAGAACTC 


251 


GTCAAAACCT 


301 


ACCGTCAGAC 


351 


GCCGCTGCTG 


401 


GCCGCGACCT 


451 


CACGAAGCCC 


501 


CGGCGGGATT 


551 


GCTTCGACAC 


601 


CACTGGCAGC 


651 


CATATTGCCG 


701 


AACCCCACCC 


751 


GAAACCTACC 


801 


TTCCCGATTC 


851 


CAGATGCCCG 


901 


TTAATGGCGG 


951 


CACCGCCGAA 


1001 


CATGGATGGC 


1051 


GCAACCGGCG 


1101 


A 



GTACTGATAC 
CTTTACCCCC 
ACACAGGCGC 
AGCCGCCTGT 
CGCGCCGTCC 
ACGCGCCGGA 
GCGGAACGGA 
TGCGGCCGGC 
TGTTCCGCGA 
GCCAACATCA 
AGGACCGGGC 
TTCCTTACGA 
CAACTGCTCG 
TAAAAGCACG 
TTGACGGCGG 
ACCGCGCAAA 
TCAAATGTAC 
ATTTGGCAGA 
CTGAACCTCG 
GGCGTGTTGG 
CATCCAAACC 



GGATGGACGG 
TACCCCGGCA 
GGACGAACTG 
ACGCGCAAAC 
GACATTACCG 
ACACAGTTAC 
CTCAGATTTT 
GGACAAGGCG 
CGACAGGGAA 
GCGTACTCCC 
AATATGCTGA 
CAAAAACGGT 
ACAGGCTGCT 
GGGCGCGT^C 
CGAAAACCGA 
CCGTTTTCGA 
ATTTGCGGCG 
ATGTTTCGGC 
ATCCGCAATG 
GTCAACCGCA 
GTGTATTCTG 



CGGCAAATGG 
GGTTACGCCG 
CACCGCAGCA 
CGCCGCCGAA 
CCCTCGGCTG 
AGCGTACAGC 
TACCGTCGGC 
CGCCGCTCGT 
ACACGCGCGG 
CCCCGACGCA 
TGGACGCGTG 
GCAAAGGCGG 
CGCCCACCCG 
TGTTTGCCCT 
TACGACGTAT 
CGCCGTCTCA 
GCGGCATCCG 
ACACGCGTTT 
GGTAGAAGCC 
TTCCCGGTAG 
GGCGCGGGAT 



CTGGGCGCGG 
CAAATTGCTG 
GGATGTTGTC 
CTGCTGTGCA 
CCACGGGCAA 
TTGCCGATTT 
GACTTCCGCA 
CCCCGCCTTT 
TACTGAACAT 
CCCGCCTTCG 
GATGCAGGCA 
CACAAGGCAA 
TATTTCGCAC 
AAATTGGCTC 
TGCGGACGCT 
CACGCAGCGG 
CAATCCTGTT 
CCCTGCACAG 
GCCGCGTTCG 
TCCGCACAAA 
ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 3084; ORF 121-l.a>: 

al21-l.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHST7VE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21-l/al21-l ORFs 121-1 and 121-1. a showed a 96.4% identity in 366 aa overlap 



ml21-l.pep 



al21-l 



ml21-l .pep 



al21-l 



ml21-l . pep 



al21-l 



ml21-l .pep 



al21-l 



ml21-l.pep 



al21-l 



10 20 30 40 50 60 
METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
M I I I I M I I I I I I I I I I [ I I I I I I I I I I I I I [ I I I I : I I I I I I I I i ] I I 

METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 
10 20 30 40 50 60 

70 80 90 100 110 120 
HRSRILSQELSRLYAQTAMLLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
IMhIIIIIIIIII Mill 1111111111111111:11: ] 

HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 
70 80 90 100 110 120 

130 140 150 160 170 180 

AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 

I >> I: I I M I I I I I I I I I I t I I I I M I I I I I I : I I I I I I | | | | | j 

AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

mil iiiiiii iiiiiiiiiiiiiii iiiiiiiiiMiiiiiiiii 

PAFGFDTGPGNMLMDAWMQAHWQLPYDiCNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
IXIIIIIIIIII I Mill I IIIIIIIIIIIIIII 

GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 
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310 320 330 340 350 360 

ml21-l . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 

tlllllllllll 111:1111:111111111 MM 

al21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
I I I I I I 

al21 GAGYYYX 



128 and 128-1 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 3085>: 

ml28.seq (partial) 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTGCGTCX3C CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACT^GACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCAC 

1 TACGCCAGCG AAAAACTGCG CGAAGCCAAA TACGCGTTCA GCGAAACCGA 

51 wGTCAAAAAA TAyTTCCCyG TCGGCAAwGT ATTAAACGGA CTGTTCGCCC 

101 AAmTCAAAAA ACTmTACGGC ATCGGATTTA CCGAAAAAAC yGTCCCCGTC 

151 TGGCACAAAG ACGTGCGCTA TTkTGAATTG CAACAAAACG GCGAAinCCAT 

201 AGGCGGCGTT TATATGGATT TGTACGCACG CGAAGGCAAA CGCGGCGGCG 

251 CGTGGATGAA CGACTACAAA GGCCGCCGCC GTTTTTCAGA CGGCACGCTG 

301 CAAyTGCCCA CCGCCTACCT CGTCTGCAAC TTCGCCCCAC CCGTCGGCGG 

351 CAGGGAAGCC CGCyTGAGCC ACGACGAAAT CCTCATCCTC TTCCACGAAA 

4 01 CCGGACACGG GCTGCACCAC CTGCTTACCC AAGTGGACGA ACTGGGCGTA 
451 TCCGGCATCA ACGGCGTAkA ATGGGACGCG GTCGAACTGC CCAGCCAGTT 
501 TATGGAAAAT TTCGTTTGGG AATACAATGT CTTGGCACAA mTGTCAGCCC 
551 ACGAAGAAAC CGGcgTTCCC yTGCCGAAAG AACTCTTsGA CAAAwTGCTC 
601 GCCGCCAAAA ACTTCCAAsG CGGCATGTTC yTsGTCCGGC AAwTGGAGTT 
651 CGCCCTCTTT GATATGATGA TTTACAGCGA AGACGACGAA GGCCGTCTGA 
701 AAAACTGGCA ACAGGTTTTA GACAGCGTGC GCAAAA/U^GT CGCCGTCATC 
751 CAGCCGCCCG AATACAACCG CTTCGCCTTG AGCTTCGGCC ACATCTTCGC 
801 AGGCGGCTAT TCCGCAGCTn ATTACAGCTA CGCGTGGGCG GAAGTATTGA 
851 GCGCGGACGC ATACGCCGCC TTTGAAGAAA GCGACGATGT CGCCGCCACA 
901 GGCAAACGCT TTTGGCAGGA AATCCTCGCC GTCGGGGnAT CGCGCAGCGG 
951 nGCAGAATCC TTCAAAGCCT TCCGCGGCCG CGAACCGAGC ATAGACGCAC 

1001 TCTTGCGCCA CAGCGGTTTC GACAACGCGG TCTGA 

This corresponds to the amino acid sequence <SEQ ID 3086; ORF 128>: 

ml28 — 



II 



pep 


(partial) 




1 


MTDNALLHLG 


EEPRFDQIKT 


51 


NTVEPLTGIT 


ERVGRIWGW 


101 


GQDIELYNRF 


KTIKNSPEFD 


1 


YASEKLREAK 


YAFSETXVKK 


51 


WHKDVRYXEL 


QQNGEXIGGV 


101 


QLPTAYLVCN 


FAPPVGGREA 


151 


SGINGVXWDA 


VELPSQFMEN 


201 


AAKNFQXGMF 


XVRQXEFALF 


251 


QPPEYNRFAL 


SFGHIFAGGY 


301 


GKRFWQEILA 


VGXSRSGAES 
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The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 3087>: 

gl28 . seq 

1 atgattgaca acgCActgct ccacttgggc gaagaaccCC GTTTTaatca 

51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCX3TCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCX3CACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGccaaa CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

1401 AacCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

1451 TGTCCGGCAT CAAcggcgtA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGGCAC AAATGTCCGC 

1551 CCACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAATGGAG 

1651 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATGCCGTCT 

1701 GAAAT^CTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 

1751 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCacatCtTC 

1801 GCcggcGGCT ATTCCGCAGG CTATTACAGC TACGCATGGG CCGAAGTCCt 

1851 CAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac gtcGCCGCCA 

1901 CAGGCAAACG CTTCTGGCAA GAAAtccttg ccgtcggcgg ctCCCGCAGC 

1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 3088; ORF 128.ng>: 

gl28 -pep 

1 MIDNALLHLG EEPRFNQIQT EDIKPAVQTA lAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHUISWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 EIAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGIOIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGEPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS ESDECRLKNW QQVLDSVRKE VAVIQPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSTD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
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651 AAESFKAFRG REPSIDALLR QSGFDNAA* 



ORF 128 shows 91.7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N, gonorrhoeae: 

ml28/gl28 

10 20 30 40 50 60 

gl28.pep MIDNALLHLGEEPRFNQIQTEDIKPAVQTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 

I Illllhll: Ihllllll hllllllll 

tnl28 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
10 20 30 40 50 60 

70 80 90 100 110 120 

gl28.pep ERVGRIWGWSHLNSWDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

mil II mill I I--IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 
IT>128 ERVGRIWGWSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 140 150 160 170 180 

gl28 . pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

lllllllllhl 
ml 2 8 TLS PAQKTKLNH 

130 

// 

340 350 360 

gl28 .pep VAGEKLREAKYAFSETEVKKYPPVGKVLAG 

Ihlllllllllllll llllllll II I 

YASEKLREAKYAFSETXVKKYFPVGXVLNG 
10 20 30 



ml28 



370 380 390 400 410 420 

gl28 . pep LFAQIKKLYGIGFAEKTVPWHKDTOYFELQQNGKTIGGVYMDLYAREGKRGGAWMNDYK 

Illlllhllllllll mill:: Illllllllllllllllll 

tnl28 I'FAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWMNDYK 
40 50 60 70 80 90 

430 440 450 460 470 480 

gl28 . pep C3RRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVDELGV 

I II II HI mm II II 1 1 1 hi III nil 1 1 miiiiiiiiiiiiiiiiii 

GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEI LI LFHETGHGLHHLLTQVDELGV 
100 110 120 130 140 150 



Tnl28 



490 500 510 520 530 540 

gl2 8 . pep SGINGVEWDAVELPSQFMENFVWEYNVl^QMSAHEETGEPLPKELFDKMLAAKNFQRGMF 

mm imiiiiiiiiiiiiiiiim iiiiiii mm ii imm m 

SGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQXGMF 
ISO 170 180 190 200 210 



ml28 



550 560 570 580 590 600 

gl28 . pep liVRQMEFALFDMMIYSESDECRLKNWQQVLDSVRKEVAVIQPPEYNRFANSFGHIFAGGY 

iiiiiiiiihii iiiiiiiiiiiiii-.iiiiiiiiimi iimiiiii 

XVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIFAGGY 
220 230 240 250 260 270 



ml28 



610 620 630 640 650 660 

gl28 . pep SAGYYSYAWAEVLSTDAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRGREPS 

11^ iimiiiih iimiiiiimiiiiiiiii iih iim 

SAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRGREPS 
280 290 300 310 320 330 



ml28 
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gl2 8 . pep IDALLRQSGFDNAAX 

lllllhlllllh 
ml 2 8 IDALLRHSGFDNAVX 
340 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 3089>: 

al28. seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 PJkTChPiPJiCC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

451 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAAC/VAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

1401 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

1451 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGMCTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 3090; ORF 128.a>: 

al28.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA lAEAREQlAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFiCNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNETPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28/al28 ORFs 128 and 128.a showed a 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml 2 8. pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

l>f>lllll INIIIIIIIIIIIIIlillliiiiiMIIIMIlljllilliiii 

MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
10 20 30 40 50 60 



al28 



''O 80 90 100 110 120 

inl28.pep ERVGRIWGWSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

IXIIIIIIIIIII 1:1111111:1111 Illlllllllllillllllllilill 

al28 ERVGRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

"70 80 90 100 110 120 

130 

ml28.pep TLSPAQKTKLNH 

III I I II I I I I 

al28 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 ISO 



ml28 .pep 
al28 



FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 
190 200 210 220 230 240 



ml28 .pep 
al28 



TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
250 260 270 280 290 300 



ml28 .pep 
al28 

ml28.pep 
al28 

ml28.pep 
al28 

ml28 .pep 
al28 

inl28.pep 
al28 



140 150 
YASEKLREAKYAFSETXVKKYFPVGX 
11:1111111111111 I I II I I 11 
ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

160 170 180 190 200 210 

VLNGLFAQXKKLYGIGETEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWM 
llllllll MlllllflllllllMIIIII lllllll:illlll|||||||||||||| 
VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 

220 230 240 250 260 270 

NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 

>>l<llll IIIIIIMI:|||||:|||||||||| IIIIIIIIIIIMIIII 

NDYKGRRRFSDGTLQLPTAYLVCNBTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
430 440 450 460 470 480 

280 290 300 310 320 330 

ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 
) ■ > I I > I I I t I I I I I I I I I I I I II M I I I I M I 11 II t I I I I II II I II I I I II II 
ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 

340 350 360 370 380 390 

XGMFXVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 
■ I > III 11 ) I I I I I I II I I I I I I I I I I I I I I : I I I :: II I I I I I I | j j | | | 

RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGHIF 
550 560 570 580 590 600 
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400 410 420 430 440 450 

ml28 . pep AGGYSAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRG 
>>>>II* tlltlllllltlllllllllflllltllllllllllll ll|:|||j|IIM 
AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
610 620 630 640 650 660 



al28 



460 470 
ml28 .pep REPSIDALLRHSGFDNAVX 

I i I I I I I I I I I I I I I I I : 
al28 REPSIDALLRHSGFDNAAX 
670 



Further work revealed the DNA sequence identified in A^. meningitidis <SEQ ID 3091>: 

ml28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCGC CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA T^AAATTCCCC 

351 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG C/ACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAAGCAA AACAGGCTAC AAAATCGGCT TGCAGATTCC 

651 ACACTACCTC GCCGTCATCC AATACGCCGA CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAACTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGCAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAAGTTT TT^CTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTGA ACCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCAG CGAAAAACTG CGCGAAGCCA AATACGCGTT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC ACCCGTCGGC 

1351 GGCAGGGAAG CCCGCCTGAG CCACGACGAA ATCCTCATCC TCTTCCACGA 

1401 AACCGGACAC GGGCTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

1451 TATCCGGCAT CAACGGCGTA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCAC AAATGTCAGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGCATGT TCCTCGTCCG GCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAA/^ GTCGCCGTCA 

1751 TCCAGCCGCC CGAATACAAC CGCTTCGCCT TGAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGC CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGTT TCGACAACGC GGTCTGA 

This corresponds to the amino acid sequence <SEQ ID 3092; ORF 128-1 >: 

ml28-l,pep. 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA lAETVREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVADTP ELRAVYNELM PEITVFFTEI 
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101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NHDLRDEVLS GAELPPECX3A 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSESKTGY KIGLQIPHYL AVIQYADNRE LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLANALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DIAEVKAFAR ESLNLADLQP WDLGYASEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFAPPVG 

451 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAV* 

The following DNA sequence was identified in A^. gonorrhoeae <SEQ ID 3093>: 

gl28-l.seq (partial) 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGCCAAA CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 

1301 . TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TGTCCGGCAT CAACGGCGTA AAA 

This corresponds to the amino acid sequence <SEQ ID 3094; ORF 128-l.ng>; 

gl28-l.pep (partial) 

1 MIDNALLHLG EEPRFNQIKT EDIKPAVQTA lAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 



ml2B-l/gl28-l ORFs 128-1 and 128-1. ng showed a 94.5% identity in 491 
overlap 
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gl28-l . pep MIDNALLHLGEEPRFNQIKTEDIKPAVQTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 
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iMIMMII:lllllllllllllill:llllillltllj|||||||||||||||||||| 
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490 

gl28-l.pep ELGVSGINGVK 
I I I I I I I I I I : 

ml28-l ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3095>: 

al28-l.seq 

1 ATGACTGACA 

51 AATCAAAACC 

101 CGCGCGAACA 

151 AACACTGTCG 

201 GGGCGTGGTG 

251 CCGCCTACAA 

301 GGACAAGACA 

351 CGAGTTCGAC 

401 TGCGCGATTT 

4 51 GAATTGGCAA 



ACGCACTGCT CCATTTGGGC 
GAAGACATCA AACCCGCCCT 
AATCGCCGCC ATCAAAGCCC 
AACCCCTGAC CGGCATCACC 
TCGCACCTCA ACTCCGTCAC 
TGAATTAATG CCCGAAATTA 
TCGAGCTGTA CAACCGCTTC 
ACCCTCTCCC ACGCGCAAAA 
CGTCCTCAGC GGCGCGGAAC 
AACTGCAAAC CGAAGGCGCG 



GAAGAACCCC GTTTTGATCA 
GCAAACCGCC ATTGCCGAAG 
AAACGCACAC CGGCTGGGCA 
GAACGCGTCG GCAGGATTTG 
CGACACGCCC GAACTGCGCG 
CCGTCTTCTT CACCGAAATC 
AAAACCATCA AAAACTCCCC 
AACCAAACTC AACCACGATC 
TGCCGCCCGA ACAGCAGGCA 
CAACTTTCCG CCAAATTCTC 
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501 CCAAAACGTC CTAGACGCGA 

551 CCGCACCGCT TGCCGGCATT 

601 GCCGCGCAAA GCGAAGGCAA 

651 GCACTACCTC GCCGTCATCC 

701 AAATCTACCG CGCCTACGTT 

751 AAATTCGACA ACACCGCCAA 

801 AACCGCCAAA CTGCTCGGCT 

851 CCAAAATGGC GGACACCCCC 

901 GCCCGCCGCG CCAAACCCTA 

951 CTTCGCCCGC GAAAGCCTCG 

1001 GCTACGCCGG CGAAAAACTG 

1051 GAAGTCAAAA AATACTTCCC 

1101 CCAAATCAAA AAACTCTACG 

1151 TCTGGCACAA AGACGTGCGC 

1201 ATAGGCGGCG TTTATATGGA 

1251 CGCGTGGATG AACGACTACA 

1301 TGCAACTGCC CACCGCCTAC 

1351 GGCAAAGAAG CCCGCTTGAG 

1401 AACCGGACAC GGCCTGCACC 

14 51 TATCCGGCAT CAACGGCGTA 

1501 TTTATGGAAA ATTTCGTTTG 

1551 CCACGAAGAA ACCGGCGTTC 

1601 TCGCCGCCAA AAACTTCCAA 

1651 TTCGCCCTCT TTGATATGAT 

1701 GAAAAACTGG CAACAGGTTT 

1751 TCCGACCGCC CGAATACAAC 

1801 GCAGGCGGCT ATTCCGCAGG 

1851 GAGCGCGGAC GCATACGCCG 

1901 CAGGCAT^CG CTTTTGGCAG 

1951 GCGGCAGAAT CCTTCAAAGC 

2001 ACTCTTGCGC CACAGCGGCT 



CCGACGCGTT CGGCATTTAC TTTGACGATG 
CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 
MCAGGCTAC AAAATCGGTT TGCAGATTCC 
AATACGCCGA CAACCGCAAA CTGCGCGAAC 
ACCCGCGCCA GCGAGCTTTC AGACGACGGC 
CATCGACCGC ACGCTCGAAA ACGCCCTGCA 
TCAAAAACTA CGCCGAATTG TCGCTGGCAA 
GAACAAGTTT TAAACTTCCT GCACGACCTC 
CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
GCCTCGCCGA TTTGCAACCG TGGGACTTGG 
CGCGAAGCCA AATACGCATT CAGCGAAACC 
CGTCGGCAAA GTATTAAACG GACTGTTCGC 
GCATCGGATT TACCGAAAAA ACCGTCCCCG 
TATTTTGAAT TGCAACAAAA CGGCGAAACC 
TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

ctcgtctgca acttcacccc gcccgtcggc 
ccatgacgaa atcctcaccc tcttccacga 
acctgcttac ccaagtcgac gaactgggcg 

GAATGGGACG CAGTCGAACT GCCCAGTCAG 
GGAATACAAT GTCTTGGCGC AAATGTCCGC 
CCCTGCCGAA AGAACTCTTC GACAAAATGC 
CGCGGAATGT TCCTCGTCCG CCAAATGGAG 
GATTTACAGC GAAGACGACG AAGGCCGTCT 
TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 
CGCTTCGCCA ACAGCTTCGG CCACATCTTC 
CTATTACAGC TACGCGTGGG CGGAAGTATT 
CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 
CTTCCGCGGA CGCGAACCGA GCATAGACGC 
TCGACAACGC GGCTTGA 



This corresponds to the amino acid sequence <SEQ ID 3096; ORF 128-1. a>: 

al28-l.pep 

1 MTDKALLHLG EEPRFDQIKT EDIKPALQTA lAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGVV SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAA* 



inl28-l/al28-l ORFs 128-1 and 128-l.a showed a 97.8% identity in 677 aa overlap 
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ral28-l 
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inl28-l 
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ml28-l 

al28-l,pep 
ml28-l 
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inl28-l 
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ral28-l 
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206 



The following partial DNA sequence was identified in N. meningitidis <SEQ DD 3097>: 

m206.seq 
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1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAr AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGsAAA ATCCCCGAcA GCCGCyTCAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

4 01 ACGTCGGACT CTACATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

451 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TTTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3098; ORF 206>: 

m206 .pep. . 

1 MFPPDKTLFL CLSALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRXKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 3099>: 

g206.seq 

1 atgttttccc ccgacaaaac ccttttcctc tgtctcggcg cactgctcct 

51 cgcctcatgc ggcacgacct ccggcaaaca ccgccaaccg aaacccaaac 

101 agacagtccg gcaaatccaa gccgtccgca tcagccacat cggccgcaca 

151 caaggctcgc aggaactcat gctccacagc ctcggactca tcggcacgcc 

201 ctacaaatgg ggcggcagca gcaccgcaac cggottcgac tgcagcggca 

251 tgattcaatt ggtttacaaa aacgccctca acgtcaagct gccgcgcacc 

301 gcccgcgaca tggcggcggc aagccgcaaa atccccgaca gccgcctcaa 

351 ggccggcgac atcgtattct tcaacaccgg cggcgcacac cgctactcac 

401 acgtcggact ctacatcggc aacggcgaat tcatccatgc ccccggcagc 

451 ggcaaaacca tcaaaaccga aaaactctcc acaccgtttt acgccaaaaa 

501 ctaccttgga gcgcatacgt tttttacaga atga 
This corresponds to the amino acid sequence <SEQ ID 3100; ORF 206.ng>: 

g206 .pep 

1 MFSPDKTLFL CLGALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIGRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQLVYK NALNVKLPRT 

101 ARDMAAASRK I PDSRLKAGD IVFFNTGGAH RYSHVGLYIG NGEFIHAPGS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 



ORF 206 shows 96.0% identity over a 177 aa overlap with a predicted ORF (ORF 206.ng) 
from A^. gonorrhoeae: 

Tn206/g206 
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130 140 150 160 170 

The following partial DNA sequence was identified in A^. meningitidis <SEQ E) 3101>: 

a206.seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA 6CCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAA AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGCAAA ATCCCCGACA GCCGCCTTAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

4 01 ACGTCGGACT CTATATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

451 GGCAAAACCA TCAAAACCGA . AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TCTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3102; ORF 206.a>: 

a206 .pep 

1 MFPPDK TLFL CLSALLLASC GTT SGKHROP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

m206/a206 ORFs 206 and 206.a showed a 99.4% identity in 1 77 aa overlap 
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a206 LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRLKAGD 

70 80 90 100 110 120 

130 140 150 160 170 

in206 . pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
I I Mt M M I M M I M M M M M M M M M t M M M M M I I I M M M M M I 
a206 LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

130 140 150 160 170 



287 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3103>: 

in287 .seq 

1 ATGTTTAAAC GCAGCGTAAT CGCAATGGCT TGTATTTTTG CCCTTTCAGC 

51 CTGCGGGGGC GGCGGTGGCG GATCGCCCGA TGTCAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTTCTG AAAAAGAGAC AGAGGCAAAG 

151 GAAGATGCGC CACAGGCAGG TTCTCAAGGA CAGGGCGCGC CATCCGCACA 

201 AGGCAGTCAA GATATGGCGG CGGTTTCGGA AGAAAATACA GGCAATGGCG 

251 GTGCGGTAAC AGCGGATAAT CCCAAAAATG AAGACGAGGT GGCACAAAAT 

301 GATATGCCGC AAAATGCCGC CGGTACAGAT AGTTCGACAC CGAATCACAC 

351 CCCGGATCCG AATATGCTTG CCGGAAATAT GGAAAATCAA GCAACGGATG 

4 01 CCGGGGAATC GTCTCAGCCG GCAAACCAAC CGGATATGGC AAATGCGGCG 

451 GACGGAATGC AGGGGGACGA TCCGTCGGCA GGCGGGCAAA ATGCCGGCAA 

501 TACGGCTGCC CAAGGTGCAA ATCAAGCCGG AAACAATCAA GCCGCCGGTT 

551 CTTCAGATCC CATCCCCGCG TCAAACCCTG CACCTGCGAA TGGCGGTAGC 
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601 AATTTTGGAA GGGTTGATTT GGCTAATGGC GTTTTGATTG ACGGGCCGTC 

651 GCAAAATATA ACGTTGACCC ACTGTAAAGG CGATTCTTGT AGTGGCAATA 

701 ATTTCTTGGA TGAAGAAGTA CAGCTAAAAT CAGAATTTGA AAAATTAAGT 

751 GATGCAGACA AAATAAGTAA TTACAAGAAA GATGGGAAGA ATGATAAATT 

801 TGTCGGTTTG GTTGCCGATA GTGTGCAGAT GAAGGGAATC AATCAATATA 

851 TTATCTTTTA TAAACCTAAA CCCACTTCAT TTGCGCGATT TAGGCGTTCT 

901 GCACGGTCGA GGCGGTCGCT TCCGGCCGAG ATGCCGCTGA TTCCCGTCAA 

951 TCAGGCGGAT ACGCTGATTG TCGATGGGGA AGCGGTCAGC CTGACGGGGC 

1001 ATTCCGGCAA TATCTTCGCG CCCGAAGGGA ATTACCGGTA TCTGACTTAC 

1051 GGGGCGGAAA AATTGCCCGG CGGATCGTAT GCCCTTCGTG TTC7VAGGCGA 

1101 ACCGGCAAAA GGCGAAATGC TTGCGGGCGC GGCCGTGTAC AACGGCGAAG 

1151 TACTGCATTT CCATACGGAA AACGGCCGTC CGTACCCGAC CAGGGGCAGG 

1201 TTTGCCGCAA AAGTCGATTT CGGCAGCAAA TCTGTGGACG GCATTATCGA 

1251 CAGCGGCGAT GATTTGCATA TGGGTACGCA AAAATTCAAA GCCGCCATCG 

1301 ATGGAAACGG CTTTAAGGGG ACTTGGACGG AAAATGGCAG CGGGGATGTT 

1351 TCCGGAAAGT TTTACGGCCC GGCCGGCGAG GAAGTGGCGG GAAAATACAG 

1401 CTATCGCCCG ACAGATGCGG AAAAGGGCGG ATTCGGCGTG TTTGCCGGCA 

14 51 AAAAAGAGCA GGATTGA 

This corresponds to the amino acid sequence <SEQ ID 3104; ORF 287>: 

m287 .pep 

1 MFKRSVIAMA CIFALSA CGG GGGGSPDVKS ADTLSKPAAP WSEKETEAK 

51 EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT GNGGAVTADN PKNEDEVAQN 

101 DMPQNAAGTD SSTPNHTPDP NMLAGNMENQ ATDAGESSQP ANQPDMANAA 

151 DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS 

201 NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS 

251 DADKISNYKK DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSFARFRRS 

301 ARSRRSLPAE MPLIPVNQAD TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY 

351 GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY NGEVLHFHTE NGRPYPTRGR 

4 01 FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK T^IDGNGFKG TWTENGSGDV 

451 SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD* 

The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 3105>: 

g287 . seq 

1 atgtttaaac gcagtgtgat tgcaatggct tgtatttttc ccctttcagc 

51 ctgtgggggc ggcggtggcg gatcgcccga tgtcaagtcg gcggacacgc 

101 cgtcaaaacc ggccgccccc gttgttgctg aaaatgccgg ggaaggggtg 

151 ctgccgaaag aaaagaaaga tgaggaggca gcgggcggtg cgccgcaagc 

201 cgatacgcag gacgcaaccg ccggagaagg cagccaagat atggcggcag 

251 tttcggcaga aaatacaggc aatggcggtg cggcaacaac ggacaacccc 

301 aaaaatgaag acgcgggggc gcaaaatgat atgccgcaaa atgccgccga 

351 atccgcaaat caaacaggga acaaccaacc cgccggttct tcagattccg 

401 cccccgcgtc aaaccctgcc cctgcgaatg gcggtagcga ttttggaagg 

451 acgaacgtgg gcaattctgt tgtgattgac ggaccgtcgc aaaatataac 

501 gttgacccac tgtaaaggcg attcttgtaa tggtgataat ttattggatg 

551 aagaagcacc gtcaaaatca gaatttgaaa aattaagtga tgaagaaaaa 

601 attaagcgat ataaaaaaga cgagcaacgg gagaattttg tcggtttggt 

651 tgctgacagg gtaaaaaagg atggaactaa caaatatatc atcttctata 

701 cggacaaacc acctactcgt tctgcacggt cgaggaggtc gcttccggcc 

751 gagattccgc tgattcccgt caatcaggcc gatacgctga ttgtggatgg 

801 ggaagcggtc agcctgacgg ggcattccgg caatatcttc gcgcccgaag 

851 ggaattaccg gtatctgact tacggggcgg aaaaattgcc cggcggatcg 

901 tatgccctcc gtgtgcaagg cgaaccggca aaaggcgaaa tgcttgttgg 

951 cacggccgtg tacaacggcg aagtgctgca tttccatatg gaaaacggcc 

1001 gtccgtaccc gtccggaggc aggtttgccg caaaagtcga tttcggcagc 

1051 aaatctgtgg acggcattat cgacagcggc gatgatttgc atatgggtac 

1101 gcaaaaattc aaagccgcca tcgatggaaa cggctttaag gggacttgga 

1151 cggaaaatgg cggcggggat gtttccggaa ggttttacgg cccggccggc 

1201 gaggaagtgg cgggaaaata cagctatcgc ccgacagatg ctgaaaaggg 

1251 cggattcggc gtgtttgccg gcaaaaaaga tcgggattga 

This corresponds to the amino acid sequence <SEQ ID 3106; ORF 287.ng>: 

g287 .pep 

1 MFKRSVIAMA CIFPLSA CGG GGGGSPDVKS ADTPSKPAAP WAENAGEGV 
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51 LPKEKKDEEA AGGAPQADTQ DATAGEGSQD MAAVSAENTG NGGAATTDNP 

101 KNEDAGAQND MPQNAAESAN QTGNNQPAGS SDSAPASNPA PANGGSDFGR 

151 TNVGNSWID GPSQNITLTH CKGDSCNGDN LLDEEAPSKS EFEKLSDEEK 

201 IKRYKKDEQR ENFVGLVADR VKKDGTNKYI IFYTDKPPTR SARSRRSLPA 

251 EIPLIPVNQA DTLIVDGEAV SLTGHSGNIF APEGNYRYLT YGAEKLPGGS 

301 YALRVQGEPA KGEMLVGTAV YNGEVLHFHM ENGRPYPSGG RFAAKVDFGS 

351 KSVDGIIDSG DDLHMGTQKF KAAIDGNGFK GTWTENGGGD VSGRFYGPAG 

401 EEVAGKYSYR PTDAEKGGFG VFAGKKDRD* 



m287/g287 ORFs 287 and 287.ng showed a 70.1% identity in 499 aa overlap 



in287 .pep 
g287 

in287.pep 
g287 

m287 .pep 
g2B7 



10 20 30 40 4 9 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSE KETEA 

M I I I 1 I M It I I I I t I ] i I I I [Ill I M I I I I I : t I : || 

MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPWAENAGEGVLPKEKKDEEA 

10 20 30 40 50 60 

50 60 70 80 90 100 109 

KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT 

WW '\ I :::lllllllll llllllll:|:||lllll llllllllll 
AGGAPQADTQD— ATAGEGSQDMAAVSAENTGNGGAATTDNPKNEDAGAQNDMPQNAA— 
70 80 90 100 110 

110 120 130 140 150 160 169 

DS ST PNHT PDPNMLAGNMENQATDAGE S SQPANQPDMANAADGMQGDD PSAGGQNAGNTA 



170 180 190 200 210 220 229 

m287.pep AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDS 

••111:1111 lllll lllllll}||||:|||::::|:|:||j |||i|| 

g287 -ESANQTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSWIDGPSQNITLTHCKGDS 
120 130 140 150 160 170 

230 240 250 260 270 280 289 

m287 . pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKP 

l-l'I'llll- illlMlil :M: 111! : ::|||N|| |: | j: 

g2 8 7 CNGDNLLDEEAPSKSE FEKLS DEEKI KRYKKDEQRENFVGLVADRVKKDGTNKYI I FYT D 

180 190 200 210 220 230 

290 300 310 320 330 340 349 

m287.pep KPTSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 

II ' I WWW Illllllllllllllllllllllllllll 

g287 KPPT RSARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 

240 250 260 270 280 290 

350 360 370 380 390 400 409 

m287 . pep YGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAICVDFGS 
I I M II I II II I I I I I I I I I I I I I I : I : I II I II I II II I I II I II : I II I II II I I t 
g287 YGAEKLPGGSYALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAKVDFGS 
300 310 320 330 340 350 

410 420 430 440 450 460 469 

m287 . pep KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYR 
I I M M I II I I II I I I II t I II I I I I II I I I I I II II: I II II: li I I I I I I 1 I II II II 
g287 KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYR 
360 370 380 390 400 410 



m2 8 7 . pep 



470 480 489 

PT DAEKGG FG V FAGKKE Q DX 
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I I I I I I I I I I I I I I I I :: I I 
g287 PTDAEKGGFGVFAGKKDRDX 
420 430 



identified in K meningitidis <SEQ ID 3107>: 



The following partial DNA sequence was 

a287.seq 

1 ATGTTTAAAC GCAGTGTGAT TGCAATGGCT TGTATTGTTG 

51 CTGTGGGGGC GGCGGTGGCG GATCGCCCGA TGTTAAGTCG 

101 TGTCAAAACC TGCCGCCCCT GTTGTTACTG AAGATGTCGG 

151 CTGCCGAAAG AAAAGAAAGA TGAGGAGGCG GTGAGTGGTG 

201 CGATACGCAG GACGCAACCG CCGGAAAAGG CGGTCAAGAT 

251 TTTCGGCAGA AAATACAGGC AATGGCGGTG CGGCAACAAC 

301 GAAAATAAAG ACGAGGGACC GCAAAATGAT ATGCCGCAAA 

351 TACAGATAGT TCGACACCGA ATCACACCCC TGCACCGAAT 

401 GAGATATGGG AAACCAAGCA .CCGGATGCCG GGGAATCGGC 

451 AACCAACCGG ATATGGCAAA TGCGGCGGAC GGAATGCAGG 

501 GTCGGCAGGG GAAAATGCCG GCAATACGGC AGATCAAGCT 

551 CTGAAAACAA TCAAGTCGGC GGCTCTCAAA ATCCTGCCTC 

601 CCTAACGCCA CGAATGGCGG CAGCGATTTT GGAAGGATAA 

651 TGGCATCAAG CTTGACAGCG GTTCGG7VAAA TGTAACGTTG 

701 7VAGACAAAGT ATGCGATAGA GATTTCTTAG ATGAAGAAGC 

751 TCAGAATTTG AAAAATTAAG TGATGAAGAA AAAATTAATA 

801 AGACGAGCAA CGAGAGAATT TTGTCGGTTT GGTTGCTGAC 

851 AGAATGGAAC TAACAAATAT GTCATCATTT ATAAAGACAA 

901 TCTTCATCTG CGCGATTCAG GCGTTCTGCA CGGTCGAGGC 

951 GGCCGAGATG CCGCTGATTC CCGTCAATCA GGCGGATACG 

1001 ATGGGGAAGC GGTCAGCCTG ACGGGGCATT CCGGCAATAT 

1051 GAAGGGAATT ACCGGTATCT GACTTACGGG GCGGAAAAAT 

1101 ATCGTATGCC CTCAGTGTGC AAGGCGAACC GGCAAAAGGC 

1151 CGGGCACGGC CGTGTACAAC GGCGAAGTGC TGCATTTCCA 

1201 GGCCGTCCGT CCCCGTCCGG AGGCAGGTTT GCCGCAAAAG 

1251 CAGCAAATCT GTGGACGGCA TTATCGACAG CGGCGATGAT 

1301 GTACGCAAAA ATTCAAAGCC GTTATCGATG GAAACGGCTT 

1351 TGGACGGAAA ATGGCGGCGG GGATGTTTCC GGAAGGTTTT 

1401 CGGCGAAGAA GTGGCGGGAA AATACAGCTA TCGCCCGACA 

1451 AGGGCGGATT CGGCGTGTTT GCCGGCAAAA AAGAGCAGGA 



CCCTTTCAGC 
GCGGACACGC 
GGAAGAGGTG 
CGCCGCAAGC 
ATGGCGGCAG 
GGATAATCCC 
ATGCCGCCGA 
ATGCCAACCA 
ACAACCGGCA 
GGGACGATCC 
GCAAATCAAG 
TTCAACCAAT 
ATGTAGCTAA 
ACACATTGTA 
ACCACCAAAA 
AATATAAAAA 
AGGGTAGAAA 
GTCCGCTTCA 
GGTCGCTTCC 
CTGATTGTCG 
CTTCGCGCCG 
TGTCCGGCGG 
GAAATGCTTG 
TATGGAAAAC 
TCGATTTCGG 
TTGCATATGG 
TAAGGGGACT 
ACGGCCCGGC 
GATGCGG7\AA 
TTGA 



This corresponds to the amino acid sequence <SEQ ID 3108; ORF 287.a>: 

a287.pep 

MFKRSVIAMA CIVALSA CGG GGGGSPDVKS ADTLSKPAAP WTEDVGEEV 
LPKEKKDEEA VSGAPQADTQ DATAGKGGQD MAAVSAENTG NGGAATTDNP 
ENKDEGPQND MPQNAADTDS STPNHTPAPN MPTRDMGNQA PDAGESAQPA 
NQPDMANAAD GMQGDDPSAG ENAGNTADQA ANQMNNQVG GSQNPASSTN 
PNATNGGSDF GRINVANGIK LDSGSENVTL THCKDKVCDR DFLDEEAPPK 
SEFEKLSDEE KINKYKKDEQ RENFVGLVAD RVEKNGTNKY VIIYKDKSAS 
SSSARFRRSA RSRRSLPAEM PLIPVNQADT LIVDGEAVSL TGHSGNIFAP 
EGNYRYLTYG AEKLSGGSYA LSVQGEPAKG EMLAGTAVYN GEVLHFHMEN 
GRPSPSGGRF AAKVDFGSKS VDGIIDSGDD LHMGTQKFKA VIDGNGFKGT 
WTENGGGDVS GRFYGPAGEE VAGKYSYRPT DAEKGGFGVF AGKKEQD* 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 



m2B7/a287 

m287.pep 
a287 

m287 .pep 
a287 



ORFs 287 and 287. a showed a 77.2% identity in 501 aa overlap 

10 20 30 40 49 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSE KETEA 

XlfHIMIII lllllllllllllllllll IlilMllll:! |: ,, 

MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPWTEDVGEEVLPKEKKDEEA 
10 20 30 40 50 60 

50 60 70 80 90 100 109 

KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT 
) ) I I : I i : : : I : I I I I I I I I I I I I I I I : I : I I I : j : I I [ | | | | | | | | | 

VSGAPQADTQ— DATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADT 
''O 80 90 100 110 
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110 120 130 140 150 160 169 

m287 . pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 
IMIIIIII Ml : :| ||| I I I I I : I I I M I I I I M I I I I I I I I I I I I :)||||| 
a287 DSSTPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAG-ENAGNTA 
120 130 140 150 160 170 

170 180 190 200 210 220 229 

m287 . pep AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDS 
f:HII ::|| : | | | | : | t | : : : l | | : :|: |:|:|,,,,, 

a287 DQAANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKV 
180 190 200 210 220 230 

230 240 250 260 270 280 289 

m287 . pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKP 

I- Mllll: IIMIUII : ::||||MI |: :| |:|:|:|| 

a287 CD-RDFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVIIYKD 
240 250 260 270 280 290 

290 300 310 320 330 340 

m287,pep KP—TSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRY 
t ' ) N I I I I I I I I I I I I I I I I M I I I I I i I I I I I I I I I I I I I I I [ I I M I I I I I I I 
a287 KSASSSSARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRY 
300 310 320 330 340 350 

350 360 370 380 390 400 

in287 . pep LTYGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDF 

I I I I I I M I I I I M I I f i I I I I I I I t I : I I I I i I I I I I I Mill I : I M II II M 
a287 LTYGAEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDF 
360 370 380 390 400 410 

410 420 430 440 450 460 

m287 . pep GSKSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYS 

MIIMMMIilMIIMMMIMMIMIMMIMMIIiMMMMIMMIII 
a2 8 7 GSKSVDGI I DSGDDLHMGTQKFKAVI DGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYS 

420 430 440 450 460 470 

470 480 489 

iii287 . pep YRPTDAEKGGFGVFAGKPCEQDX 

I II II M M II I I M II M M I 
a287 YRPTDTVEKGGFGVFAGKKEQDX 
480 490 



406 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3109>: 

in406.seq 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TTGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 CACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

301 GATTACACCT ATCCACGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

401 CTCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 

451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 

601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 
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701 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT 

751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTT^GCAA 

801 AGGAATTAAA CCGACGGAAG GATTAATGGT CGATTTCTCC GATATCCGAC 

851 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GTAGTGCGAC AACATAGACA 

951 AGGACAACCT TGA 

This corresponds to the amino acid sequence <SEQ ID 31 10; ORF 406>: 

m406 .pep 

1 MQARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FliSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 

251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIRPYGNHTG NSAPSVEADN 

3 01 SHEGYGYSDE WRQHRQGQP * 

The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 31 1 1>: 

g406.8eq 

1 ATGCGGGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGCAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGCGGC GAATACATAA ACAGCCCTGC CGTCCGCACC 

301 GATTACACCT ATCCGCGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACGGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

401 CGCGCACCCA ATCAGACGGT AGCGGAAGTA GGAGCAGTCT GGGCTTAAAT 

451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CCAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTGCA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 

601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 

701 GAACCAATAA AAAATTGCTC ATCAAACCCA AAACCAATGC GTTTGAAGCT 

751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 

801 AGGAATCAAA CCGACGGAAG GATTGATGGT CGATTTCTCC GATATCCAAC 

851 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC AACATAGACA 

951 AGGGCAACCT TGA 

This corresponds to the amino acid sequence <SEQ ID 31 12; ORF 406>: 

g406.pep 

1 MRARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSRSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 

251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHTG NSAPSVEADN 

301 SHEGYGYSDE AVRQHRQGQP ♦ 



ORF 406 shows 98.8% identity over a 320 aa overlap with a predicted ORF (ORF406.a) from 
A^. gonorrhoeae: 

g406/m406 

10 20 30 40 50 60 

g406.pep MRARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 

I^INIIIMIIIIIIIIIIIIIIIIIIIMIIIilllllMMiiiiiiiiiiiiiiii 

in4 0 6 MQARLLI PI LFSVFI LS ACGTLTGI PSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 
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10 20 30 40 50 60 

70 80 90 100 110 120 

g406 . pep KVALYIATMGDOGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

MIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIlllMiiMMill 

m406 KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 
70 80 90 100 110 120 

130 140 150 160 170 180 

g4 0 6 . pep LTTSLSTLNAPALSRTQSDGSGSRSSLGLNI GGMGDYRNETLTTNPRDTAFLSHLVQTVF 

MMIIIMIMIIIIIIIIIIhllllllllMIIMIIIIIIMIIIIIIIIIIIIII 

ni406 LTTSLSTLNAPALSRTQSroSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 
130 140 150 160 170 180 

190 200 210 220 230 240 

g406.pep FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

IMIMMIMIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIII 

m406 FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 
190 200 210 220 230 240 

250 260 270 280 290 300 

g406 . pep IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLI^FSDIQPYGNHTGNSAPSVEADN 

MIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIhllllllllllMIIIII 

m406 IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIRPYGNHTGNSAPSVEADN 
250 260 270 280 290 300 



310 320 
g4 0 6 . pep SHEGYGYSDEAVRQHRQGQPX 

Illllllllhllllllllll 
ra4 0 6 SHEGYGYSDEWRQHRQGQPX 
310 320 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 31 13>: 

a406. seq 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

301 GATTACACCT ATCCACGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

401 CGCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 

451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACGGATGT GTTTATTAAC 

601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 

701 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT 

751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGACCGTATA AAGTAAGCAA 

801 AGGAATTAAA CCGACAGAAG GATTAATGGT CGATTTCTCC GATATCCAAC 

851 CATACGGCAA TCATATGGGT AACTCTGCCC CATCCGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC GACATAGACA 

951 AGGGCAACCT TGA 



This corresponds to the amino acid sequence <SEQ ID 3 1 14; ORF 406.a>: 

a406.pep 

1 MQARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 
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251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHMG NSAPSVEADN 
301 SHEGYGYSDE AVRRHRQGQP * 

m406/a406 ORFs 406 and 406. a showed a 98.8% identity in 320 aa overlap 

10 20 30 40 50 60 

m406.pep MQARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 

IIIINIIIIIIM ) lllltlllllMIIIIIIIIIIMIIlillllM 

a406 MQARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 
10 20 30 40 50 60 

"70 80 90 100 110 120 

m4 06 . pep KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

M I t I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I t I I ! I I I I I I I 

a4 06 KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

''O 80 90 100 110 120 

130 140 150 160 170 180 

m4 06 . pep LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

Ifllllltllllll lllliMllllillllll IIIIIIMIIilllllll 

a406 LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

130 140 150 160 170 180 

190 200 210 220 230 240 

m4 06 . pep FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

IIINIIMIIIIIIIIIIilllll lllllllllllllllllMlllltlllil 

a406 FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m4 0 6 . pep IKPKTNAFEAAYKENYALWI^GPYKVSKGIKPTEGLMVDFSDIRPYGNHTGNSAPSVEADN 
IIIIIIIMIIIMIIIIIIIIIIIIIIIIMIIIIIIIII|:ill|| IIIIIIIIIM 
a406 IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIQPYGNHMGNSAPSVEADN 
250 260 270 280 290 300 

310 320 
m406 . pep SHEGYGYSDEWRQHRQGQPX 

a4 06 SHEGYGYSDEAVRRHRQGQPX 
310 320 



EXAMPLE 2 
Expression of ORF 919 
The primer described in Table 1 for ORF 919 was used to locate and clone ORF 919. 
The predicted gene 919 was cloned in pET vector and expressed in E, coll The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 919-His fusion protein purification. Mice were immunized with the purified 919- 
ffis and sera were used for Western blot (panel B), FACS analysis (panel C), bactericidal 
assay (panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; PP, 
purified protein, TP, K meningitidis total protein extract; OMV, K meningitidis outer 
membrane vesicle preparation. Arrows indicate the position of the main recombinant protein 
product (A) and the N. meningitidis immunoreactive band (B). These experiments confirm 
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that 919 is a surface-exposed protein and that it is a useful immunogen. The hydrophilicity 
plots, antigenic index, and amphipatic regions of ORF 919 are provided in Figure 10. The 
AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 
143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, 
ScandJ Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 919 and the amino acid 
sequence encoded thereby is provided in Example 1. 

EXAMPLES 
Expression of ORF 279 
The primer described in Table 1 for ORF 279 was used to locate and clone ORF 279. 
The predicted gene 279 was cloned in pGex vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 279-GST purification. Mice were immunized with the purified 279-GST and sera 
were used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay 
(panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. 
meningitidis total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. 
Arrows indicate the position of the main recombinant protein product (A) and the N. 
meningitidis immunoreactive band (B). These experiments confirm that 279 is a surface- 
exposed protein and that it is a usefiil immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 279 are provided in Figure 11 . The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 279 and the amino acid sequence encoded thereby is 
provided in Example 1. 



EXAMPLE 4 
Expression of ORF 576 and 576-1 
The primer described in Table 1 for ORF 576 was used to locate and clone ORF 576. 
The predicted gene 576 was cloned in pGex vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 576- 
GST fusion protein purification. Mice were immunized with the purified 576-GST and sera 
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were used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), 
and ELISA assay (panel E). Symbols; Ml, molecular weight marker; TP, N. meningitidis 
total protein extract; OMV, N, meningitidis outer membrane vescicle preparation. Arrows 
indicate the position of the main recombinant protein product (A) and the K meningitidis 
immunoreactive band (B).. These experiments confirm that ORF 576 is a surface-exposed 
protein and that it is a useful immunogen. The hydrophihcity plots, antigenic index, and 
amphipatic regions of ORF 576 are provided in Figure 12. The AMPHI program is used to 
predict putative T-cell epitopes (Gao et al 1989, J, Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi etal. 1992, Scand J Immunol Suppl 11:9). 
The nucleic acid sequence of ORF 576 and the amino acid sequence encoded thereby is 
provided in Example 1, 



EXAMPLES 
Expression of ORF 519 and 519-1 
The primer described in Table 1 for ORF 519 was used to locate and clone ORF 519, 
The predicted gene 519 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 519- 
ffis fusion protein purification. Mice were immunized with the purified 519-His and sera wer 
used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), and 
ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, A^. meningitidis total 
protein extract; OMV, K meningitidis outer membrane vesicle preparation. Arrows indicate 
the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B). These experiments confirm that 519 is a surface-exposed protein 
and that it is a useful immunogen. The hydrophihcity plots, antigenic index, and amphipatic 
regions of ORF 519 are provided in Figure 13. The AMPHI program is used to predict 
putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, AIDS Res 
Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1 :9). The nucleic 
acid sequence of ORF 519 and the amino acid sequence encoded thereby is provided in 
Example 1. 
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EXAMPLE 6 
Expression ofORF 121 and 121-1 
The primer described in Table 1 for ORF 121 was used to locate and clone ORF 121. 
The predicted gene 121 was cloned in pET vector and expressed in E. colL The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 121- 
His fusion protein purification. Mice were immunized with the purified 121-His and sera wen 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 121 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, A': meningitidis total protein extract; OMV, N. 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the K meningitidis immunoreactive band (B). These 
experiments confirm that 121 is a surface-exposed protein and that it is a usefiil immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 121 are provided in 
Figure 14. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 121 and the amino 
acid sequence encoded thereby is provided in Example 1. 



EXAMPLE? 
Expression of ORF 128 and 128-1 
The primer described in Table 1 for ORF 128 was used to locate and clone ORF 128. 
The predicted gene 128 was cloned in pET vector and expressed in E. colL The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 128- 
His purification. Mice were immunized with the purified 128-His and sera were used for 
Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel D) and 
ELISA assay (panel E). Results show that 128 is a surface-exposed protein. Symbols: Ml, 
molecular weight marker; TP, N. meningitidis total protein extract; OMV, K meningitidis 
outer membrane vesicle preparation. Arrows indicate the position of the main recombinant 
protein product (A) and the A^. meningitidis immunoreactive band (B). These experiments 
confirm that 128 is a surface-exposed protein and that it is a usefiil immunogen. The 
hydrophilicity plots, antigenic index, and amphipatic regions of ORF 128 are provided in 
Figure 15. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 



wo 99/57280 




PCTAJS99/09346 



115 



Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1 992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 128 and the amino 
acid sequence encoded thereby is provided in Example 1. 

EXAMPLES 
Expression of ORF 206 
The primer described in Table 1 for ORF 206 was used to locate and clone ORF 206. 
The predicted gene 206 was cloned in pET vector and expressed in E. colL The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 206- 
His purification. Mice were immunized with the purified 206-His and sera were used for 
Western blot analysis (panel B). It is worthnoting that the immunoreactive band in protein 
extracts from meningococcus is 38 kDa instead of 17 kDa (panel A). To gain information on 
the nature of this antibody staining we expressed ORF 206 in E, coli without the His-tag and 
including the predicted leader peptide. Western blot analysis on total protein extracts fi-om E, 
coli expressing this native form of the 206 protein showed a recative band at a position of 38 
kDa, as observed in meningococcus. We conclude that the 38 kDa band in panel B) is specific 
and that anti-206 antibodies, likely recognize a multimeric protein complex. In panel C is 
shovm the FACS analysis, in panel D the bactericidal assay, and in panel E) the ELISA assay. 
Results show that 206 is a surface-exposed protein. Symbols: Ml, molecular weight marker; 
TP, K meningitidis total protein extract; OMV, K meningitidis outer membrane vesicle 
preparation. Arrows mdicate the position of the main recombinant protein product (A) and the 
K meningitidis immunoreactive band (B). These experiments confirm that 206 is a surface- 
exposed protein and that it is a usefiil immunogen. The hydrophihcity plots, antigenic index, 
and amphipatic regions of ORF 519 are provided in Figure 16, The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, 1 Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1 :9). 
The nucleic acid sequence of ORF 206 and the amino acid sequence encoded thereby is 
provided in Example 1. 
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EXAMPLE 9 
Expression of ORF 287 
The primer described in Table 1 for ORF 287 was used to locate and clone ORF 287, 
The predicted gene 287 was cloned in pGex vector and expressed in E. colL The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 287- 
GST fusion protein purification. Mice were immunized with the purified 287-GST and sera 
were used for FACS analysis (panel B), bactericidal assay (panel C), and ELISA assay (panel 
D). Results show that 287 is a surface-exposed protein. Symbols: Ml, molecular weight 
marker. Arrow indicates the position of the main recombinant protein product (A). These 
experiments confirm that 287 is a surface-exposed protein and that it is a usefiil immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 287 are provided in 
Figure 17. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, 1 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1 992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 287 and the amino 
acid sequence encoded thereby is provided in Example 1. 



EXAMPLE 10 
Expression of ORF 406 
The primer described in Table 1 for ORF 406 was used to locate and clone ORF 406. 
The predicted gene 406 was cloned in pET vector and expressed in E, colL The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 406- 
ffis fiision protein purification. Mice were immunized with the purified 406-His and sera wer< 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Resuhs show that 406 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, K meningitidis total protein extract; OMV, K 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the K meningitidis immunoreactive band (B), These 
experiments confirm that 406 is a surface-exposed protem and that it is a usefiil immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 406 are provided in 
Figure 18. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
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1 992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 406 and the amino 
acid sequence encoded thereby is provided in Example 1. 

EXAMPLE 11 



Table 2 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 225 among different strains. 
Table 2 



225 gene variability: List of used Neisseria strains 


Identification Strains 


Source / reference 


number 




Group B 




zo01_225 NG6/88 


R. Moxon / Seiler et al, 1996 


zo02 225 BZ198 


R. Moxon / Seiler et al, 1996 


zo03 225 NG3/88 


R. Moxon / Seiler et al., 1996 


zo04 225 297-0 


R. Moxon / Seiler et al 1 996 


zo05 225 1000 


R. Moxon / Seiler et al 1 996 


zo06_225 BZ147 


R. Moxon / Seiler et al, 1 996 


zo07 225 BZ169 


R. Moxon / Seiler et al, 1 996 


zo08_225 528 


R. Moxon / Seiler et aL, 1996 


zo09 225 NGP165 


R Moxon / Seiler pt nl 1 QQfs, 


zol0_225 BZ133 


R. Moxon / Seiler et al, 1996 


zoll 225 NGE31 


R. Moxon / Seiler et al, 1996 


zol2 225 NGF26 


R. Moxon / Seiler et al, 1996 


zol3 225 NGE28 


R. Moxon / Seiler et al, 1996 


zol4 225 NGH38 


R. Moxon / Seiler et al, 1996 


zol5 225 SWZ107 


R. Moxon / Seiler et al, 1996 


zol6 225 NGH15 


R. Moxon / Seiler et al, 1996 


zol7 225 NGH36 


R. Moxon / Seiler et al, 1996 


zol8 225 BZ232 


R. Moxon / Seiler et al, 1996 


zol9 225 BZ83 


R. Moxon / Seiler et al, 1996 


zo20 225 44/76 


R, Moxon / Seiler et al, 1996 


zo21 225 MC58 


R, Moxon 


zo96_225 2996 


Our collection 


Group A 




zo22_225 205900 


R. Moxon 


zo23 225 F6124 


R. Moxon 


z249l Z2491 


R. Moxon / Maiden et al, 1998 


Group C 




zo24_225 90/18311 


R. Moxon 


zo25_225 93/4286 


R. Moxon 
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Others 

2o26_225 A22 (group W) R. Moxon / Maiden et aL, 1998 
zo27__225 E26 (group X) R. Moxon / Maiden a/., 1998 
zo28_225 860800 (group Y) R. Moxon / Maiden et aL, 1998 
zo29_225 E32 (group Z) R. Moxon / Maiden et aL, 1998 

Gonococcus 

zo32_225 Ng F62 R. Moxon / Maiden et aL, 1998 

zo33_225 Ng SN4 R. Moxon 

fal090 FA1090 R. Moxon 



References: 



Seiler A. et aL, Mol Microbiol, 1996, 19(4):841-856. 
Maiden et al., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 

>FA1090 <SEQ ID 3115> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z2491 <SEQ ID 3116> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVP7VRRAGNADELIGNAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSISTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

ZO01_225 <SEQ ID 3117> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO02_225 <SEQ ID 3118> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO03_225 <SEQ ID 3119> 

MDSFFKPAVWAVLWLMFAVRLALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 
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ZO04_225 <SEQ ID 3120> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAP/^RAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO05_225 <SEQ ID 3121> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO06_225 <SEQ ID 3122> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPMRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGJCNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO07_225 <SEQ ID 3123> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO08_225 <SEQ ID 3124> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO09_225 <SEQ ID 3125> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPTVRRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO10_225 <SEQ ID 3126> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPT^^RAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

2011_225 <SEQ ID 3127> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z012_225 <SEQ ID 3128> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z013_225 <SEQ ID 3129> 
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MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFIQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z014_225 <SEQ ID 3130> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z015_225 <SEQ ID 3131> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z016_225 <SEQ ID 3132> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z017_225 <SEQ ID 3133> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z018_225 <SEQ ID 3134> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z019_225 <SEQ ID 3135> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO20_225 <SEQ ID 3136> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPINRAPARRAGNADELIGSAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGN/\MGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z021_225 <SEQ ID 3137> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z022_225 <SEQ ID 3138> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGBMQHIFKRAMGINLPRTSAEQARMGTPVAR 
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SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z023_225 <SEQ ID 3139> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z024_225 <SEQ ID 3140> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z025_225 <SEQ ID 3141> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z026_225 <SEQ ID 3142> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG' 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR. 

VKKNDPSRFLN* 

Z027_225 <SEQ ID 3143> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z028_225 <SEQ ID 3144> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 

DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 

SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z029_225 <SEQ ID 3145> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z032_225 <SEQ ID 3146> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z033_225 <SEQ ID 3147> 

MDSFFKPAVWAVLWLMFAVRSALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRIKKNDPSRFLN* 

Z096 225 <SEQ ID 3148> 
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MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 



Figure 19 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 225, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 12 



Table 3 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 235 among different strains. 
Tables 



235 gene variability: List of used Neisseria strains 


Identification Strains 


Reference 


number 






Group B 




gnmzqOl 


NG6/88 


Seilere/ al, 1996 


gmnzq02 


BZ198 


Seilere^ al, 1996 


gmnzq03 


NG3/88 


Seller et aL, 1996 


gnnizq04 


1000 


Seller et aL, 1996 


gnmzqOS 


1000 


Seller a/., 1996 


gnmzqO? 


BZ169 


Seller a/., 1996 


granzqOS 


528 


Seller et aL, 1996 


gnnizq09 


NGP165 


Seller a/., 1996 


gnmzqlO 


BZ133 


Seller CM/., 1996 


gnmzqll 


NGE31 


Seller aL, 1996 


gnnizql3 


NGE28 


Seller etaL', 1996 


gmnzql4 


NGH38 


Seller a/., 1996 


gnmzqlS 


SWZ107 


Seller etaL, 1996 


gmnzql6 


NGH15 


Seller etaL, 1996 


gnmzql? 


NGH36 


Seller e/ a/., 1996 


gmnzqlS 


BZ232 


Seller et aL, 1996 


gnni2ql9 


BZ83 


Seller e/ a/., 1996 


gnmzq21 


MC58 


Virji etaL, 1992 


Group A 




gnmzq22 


205900 


Our collection 
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gimizq23 F6124 
Z2491 Z2491 



Our collection 
Maiden e/ a/., 1998 



Group C 

gnmzq24 90/18311 Our collection 
gnmzq25 93/4286 Our collection 

Others 

gnmzq26 A22 (group W) Maiden et al., 1998 
gnmzq27 E26 (group X) Maiden a/., 1998 
gnmzq28 860800 (group Y) Maiden a/., 1998 
gnmzq29 E32 (group Z) Maiden a/., 1998 
gnmzq31 Klactamica Our collection 

Gonococcus 

gnmzq32 Ng F62 Maiden et aL, 1 998 

gnm2q33 NgSN4 Our collection 

fal090 FA1090 Dempsey et al 1991 



References: 



SeilerA. e/a/.,Mol. Microbiol., 1996, 19(4):841-856. 
Maiden R. etal, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 
Virji M. et al, Mol. Microbiol., 1992, 6:1271-1279 
Dempsey J.F, etal, J. Bacteriol., 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3149> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

GNMZQOl <SEQ ID 3150> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGPCELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ02 <SEQ ID 3151> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ03 <SEQ ID 3152> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
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AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNM2Q04 <SEQ ID 3153> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNG5CELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ05 <SEQ ID 3154> 

MKPLILGIAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ07 <SEQ ID 3155> 

MKPLI LGLAAVLALSACQVQKAP D FD YT S FKESKP AS I L WPPLNE S PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ08 <SEQ ID 3156> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ09 <SEQ ID 3157> 

MKPLILGLAAALVLSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AEPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVQPEKLHQIFGNDAVLYITITEYGTS 
YQILDSVTTVSARARLVDSRNGKVLWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQIO <SEQ ID 3158> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQll <SEQ ID 3159> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ13 <SEQ ID 3160> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ14 <SEQ ID 3161> 

MKPLILGIAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ15 <SEQ ID 3162> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGKDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ16 <SEQ ID 3163> 
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MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ17 <SEQ ID 3164> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNM2Q18 <SEQ ID 3165> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ19 <SEQ ID 3166> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ21 <SEQ ID 3166> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNM2Q22 <SEQ ID 3167> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ23 <SEQ ID 3168> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNM2Q24 <SEQ ID 3169> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LWPPLNE S PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ25 <SEQ ID 3170> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ26 <SEQ ID 3171> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ27 <SEQ ID 3172> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 
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GNMZQ28 <SEQ ID 3173> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ29 <SEQ ID 3174> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ31 <SEQ ID 3175> 

MKPL I LGLAAVLALS ACQVQKAPDFD YTAFKE SKPAS I LWP PLKE S PDVNGTWGMLAST 
AEPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITITEYGTS 
YQILDSVTTVSARARLVDSRNGKVLWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKAAAYDLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ32 <SEQ ID 317 6> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILVVPPLNESPDVNGTWGMLAST 
AAPISEAGYYVFPAA\A^ETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGA\A^QIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

GNMZQ33 <SEQ ID 3177> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

Z2491 <SEQ ID 3178> 

MKPLILGLT^AVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 



Figure 20 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 235, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



Table 4 Usts several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 287 among different strains. 
Table 4 

287 gene variability: List of used Neisseria strains ~ 



EXAMPLE 13 



Identification Strains 
number 



Reference 
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Group B 



287_2 BZ198 

287_9 NGP165 

287_14 NGH38 

287 21 MC58 



Seller a/., 1996 
Seller a/., 1996 
Seller et aL, 1996 
Virji etai, 1992 



Group A 



z2491 



Z2491 



Maiden et al, 1998 



Gonococcus 

fal 090 FAl 090 Dempsey et al 1 99 1 



References; 

Seller A. etal, Mol. Microbiol., 1996, 19(4):841-856. 

Maiden R. et al, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Viiji M. et al, Mol. Microbiol., 1992, 6:1271-1279 

Dempsey J.F. et al, J. Bacteriol., 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



287_14 <SEQ ID 3179> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 
QGAPSAQGGQDMAAVSEENTGNGGAAATDKPKNEDEGAQNDMPQNAADTDSLTPNHTPAS 
NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 
TAGSQNPASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKS E FEKLSDADKI SNYKKDGKNDGKNDKFVGLVADS VQMKG INQ Y 1 1 F YKPKPT S FAR 
FRRS ARSRRSLPAEMPLI PVNQADTLI VDGEAVSLTGHSGNI FAPEGNYRYLTYGAEKLP 
GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 

DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_2 <SEQ ID 3180> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 

QGAPSAQGGQDMAAVSEENTGNGGAAATDKPKNEDEGAQNDMPQKAADTDSLTPNHTPAS 

NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 

TAGSQNPASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNNFLDEEV 

QLKSEFEKLSDADKISNYKKDGKNDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFAR 

FRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLP 

GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 

DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_21. <SEQ ID 3181> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVSEKETEAKEDAPQAGSQG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDP 
NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNQ 
AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFARFRRS 
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ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 
ALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFA/VKVDFGSKS VDGI I DSGD 
DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 
FAGKKEQD* 

287_9 <SEQ ID 3182> 

MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPWTEDVGEEVLPKEKKDEEA 
VSGAPQADTQDATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADTDS 
STPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAGENAGNTADQA 
ANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKVCDR 
DFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVIIYKDKSAS 
S S SARFRRS AR SRRSLPAEMPLI PVNQADT LI V DGEAVS LTGHSGN I FAPEGN YR YLT YG 
AEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDFGSKS 
VDGIIDSGDDLHMGTQKFKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPT 
DAEKGGFGVFAGKKEQD* 

FA1090 <SEQ ID 3183> 

MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPWAENAGEGVLPKEKKDEEA 

AGGAPQADTQDATAGEGSQDMAAVSAENTGNGGAATTDNPKNEDAGAQNDMPQNAAESAN 

QTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSWIDGPSQNITLTHCKGDSCNGDN 

LLDEEAPSKSEFEKLSDEEKIKRYKKDEQRENFVGLVADRVKKDGTNKYIIFYTDKPPTR 

SARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGS 

YALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAPCVDFGSKSVDGIIDSG 

DDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPTDAEKGGFG 
VFAGKKDRD* 

Z2491 <SEQ ID 3184> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEBCETEAKEDAPQAGSQG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDP 
NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNQ 
AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSEFEKLSDADKISNYPCKDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFARFRRS 
ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 
ALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGI I DSGD 
DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 
FAGKJCEQD* 



Figure 21 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 287, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



Table 5 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 519 among different strains. 
Tables 

519 gene variability: List of used Neisseria strains ~ 



EXAMPLE 14 



Identification Strains 



Source / reference 
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number 








Group B 




zvOl. 


519 


NG6/88 


R Moxon / Seller pt n1 1 00^ 


zv02_ 


.519 


BZ198 


R Moxnn / Seller />/ 100/^ 


zv03_ 


519ass 


NG3/88 


R Moxon / Seller /»/ 1 QQ^ 


zv04 


'519 


297-0 


R l\4nYon / Sf>ilAr of n1 100A 


zv05. 


.519 


1000 


XV. iviuAon / ociicr 6t u/., lyyo 


zvOe] 


519ass 


BZ147 


R Moxon / Seller e>t nl 1 0QA 

•*-v. ^TAv/Av/xi / kj&iiwi o{ titty X^^yj 


zv07 


'519 


BZ169 


R. Moxon / Seiler a/., 1996 


zvlf 


519 


NGE31 


R. Moxon / Seiler al.^ 1996 


zvl2 


519 


NGF26 


R. Moxon / Seiler et al, 1996 


zvl8_ 


519 


BZ232 


R. Moxon / Seiler et al, 1996 


zvl9_ 


.519 


BZ83 


R. Moxon / Seiler et al, 1996 


zv20 


519ass 


44/76 


R. Moxon / Seiler et aL, 1996 




519ass 


MC58 


R. Moxon 


zv96_ 


519 


2996 


Our collection 



Group A 

zv22_519ass 205900 R. Moxon 

z2491_519 Z2491 R. Moxon / Maiden et al, 1998 

Others 

zv26_519 A22 (group W) R. Moxon / Maiden et aL, 1998 
zv27_519 E26 (group X) R. Moxon / Maiden et aL, 1998 
zv28_519 860800 (group Y) R. Moxon / Maiden et aL, 1998 
zv29_5 1 9ass E32 (group Z) R. Moxon / Maiden et aL, 1998 

Gonococcus 

zv32_519 Ng F62 R. Moxon / Maiden et aL, 1998 

fal090_519 FA1090 R. Moxon 



References; 

Seiler A. et aL, Mol. Microbiol., 1996, 19(4):841-856. 
Maiden et al., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090_519 <SEQ ID 3185> 

MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 
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Z2491_519 <SEQ ID 3186> 

MEFFI ILLAAWVFGFKSFWI PQQEVHWERLGRFHRALTAGLNI LI PFI DRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

2V01_519 <SEQ ID 3187> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIT^SEGRKIEQINUVSGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV02_519 <SEQ ID 3188> 

ME FFI I LLVAVAVFG FKS FWI PQQE VH WERLGRFHRALTAGLN I L I P FI DRVAYRHS L 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV03_519 <SEQ ID 3189> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEXKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV04_519 <SEQ ID 3190> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV05_519 <SEQ ID 3191> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGMAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV06_519ASS <SEQ ID 3192> 

ME FFI I LLVAVAVFGFKS FWI PQQE VHWERLGRFHRALT AGLN I L I P FI DRVAYRHS L 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTVFSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERK 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV07_519 <SEQ ID 3193> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQRE7VEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVMANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 
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ZV11_519 <SEQ ID 3194> 

MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV12_519 <SEQ ID 3195> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV18_519 <SEQ ID 3196> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDtJTQLTVDGIXYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV19_519 <SEQ ID 3197> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEIKSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV20_519ASS <SEQ ID 3198> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSM 
ISAGMKIIDSSKTAK* 

ZV21_519ASS <SEQ ID 3199> 

ME FFI I LLVAVAV FGFKS FWI PQQEVH WERLGRFHRALTAGLN I L I PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE INST WAALDEAAGAWG VKVLRYE I KDLVP PQE I LRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAKANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I S AGMKI I DS SKTAK* 

ZV22_519ASS <SEQ ID 3200> 

ME FFI I LLAAVWFGFKS FWI PQQEVH WERLGRFHRALTAGLN I LI PFI DRVAYRHS L 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKT FEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAKIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV26_519 <SEQ ID 3201> 

ME FFI I LLAAWVFGFKS FWI PQQEVH WERLGRFHRALTAGLN I L I PFI DRVAYRH S L 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKT FEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV27_519 <SEQ ID 3202> 
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MEFPIILLVAVAVFGFKSFWIPQQEVHVVERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV28_519 <SEQ ID 3203> 

ME FFI ILLAAVAVFGFK.S FWI PQQE VH WERLGRFHRALTAGLN I LI PFI DRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I S AGMKI IDS SKT AK* 

ZV29_519ASS <SEQ ID 3204> 

MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSIWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREPEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSNKTAK* 

ZV32_519 <SEQ ID 3205> 

MEFFI ILLAAVAVFGFKS FWI PQQEVHWERLGRFHRALTAGLN IL I PFI DRVAYRHS L 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI IDS SKTAK* 

ZV96_519 <SEQ ID 3206> 

MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 



Figure 22 shows the results of aligning the sequences of each of these strains. Dark 
shading mdicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 519, further confirming its utiHty as an antigen for both 
vaccines and diagnostics. 



Table 6 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 919 among different strains. 
Table 6 

919 gene variability: List of used Neisseria strains ~ 



EXAMPLE 15 



Identification Strains 



Source / reference 
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Group B 






zmOl 


NG6/88 


R. Moxon / Seller et qL^ 


1996 


zm02 


BZ198 


R. Moxon / Seller et aLy 


1996 


zmOS 


NG3/88 


R. Moxon / Seller et aL, 


1996 


zm04 


297-0 


R. Moxon / Seller et qL, 


1996 


zm05 


1000 


R. Moxon / Seller et aLy 


1996 


zm06 


BZ147 


R. Moxon / Seller et aLy 


1996 


zm07 


BZ169 


R. Moxon / Seller et aLy 


1996 


zm08n 528 


R. Moxon / Seller et aLy 


1996 


zm09 


NGP165 


R. Moxon / Seller et aLy 


1996 


zmlO 


BZ133 


R. Moxon / Seller ei aLy 


1996 


zmllasbc NGE31 


R. Moxon / Seller et aLy 


1996 


zml2 


NGF26 


R. Moxon / Seller et aLy 


1996 


zml3 


NGE28 


R. Moxon / Seller et aL, 


1996 


zml4 


NGH38 


R. Moxon / Seller et al 


1996 


anl5 


SWZ107 


R. Moxon / Seller et aLy 


1996 


zml6 


NGH15 


R. Moxon / Seller et aLy 


1996 


zml7 


NGH36 


R. Moxon / Seller et aLy 


1996 


zmlS 


BZ232 


R. Moxon / Seller et aLy 


1996 


2ml9 


BZ83 


R. Moxon / Seller et aLy 


1996 


zin20 


44/76 


R. Moxon / Seller et aLy 


1996 


zm21 


MC58 


R. Moxon 




zin96 


2996 


Our collection 





Group A 

zm22 205900 
zni23asbc F6124 
Z2491 Z2491 



R. Moxon 
R. Moxon 

R. Moxon / Maiden et aLy 1998 



Group C 

2m24 90/18311 
2m25 93/4286 



R. Moxon 
R. Moxon 



Others 

2m26 A22 (group W) R. Moxon / Maiden et aLy 1998 
zm27bc E26 (group X) R. Moxon / Maiden et aLy 1998 
zm28 860800 (group Y) R. Moxon / Maiden et aLy 1998 
zni29asbc E32 (group Z) R. Moxon / Maiden et aLy 1998 
zm31asbc N. lactamica R. Moxon 

Gonococcus 

zm32asbc Ng F62 R. Moxon / Maiden et aLy 1998 

zm33asbc Ng SN4 R. Moxon 



fal090 FA1090 R. Moxon 
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References; 

Seller A. et al, MoL Microbiol, 1996, 19(4):841-856. 
Maiden et al., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3207> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YT WPHLSMPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT P VHS FQAKRFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 
IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

Z2491 <SEQ ID 3208> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZMOl <SEQ ID 3209> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YT WPHLS L PHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVHS FQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM02 <SEQ ID 3210> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM03 <SEQ ID 3211> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGFVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM04 <SEQ ID 3212> 
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MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM05 <SEQ ID 3213> 

MKKYLFRAALYGITWVILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLSCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPI YGI PDDFI S VPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

2M06 <SEQ ID 3214> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM07 <SEQ ID 3215> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM08N <SEQ ID 3216> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM09 <SEQ ID 3217> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 

YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVHS FQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZMIO <SEQ ID 3218> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFmilQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
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KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZMllASBC <SEQ ID 3219> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSXGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM12 <SEQ ID 3220> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM13 <SEQ ID 3221> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAEQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM14 <SEQ ID 3222> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSRNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM15 <SEQ ID 3223> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDLAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNHQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM16 <SEQ ID 3224> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPGRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM17 <SEQ ID 3225> 
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MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFETiHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM18 <SEQ ID 3226> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

2M19 <SEQ ID 3227> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDF7VKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFETdHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNFSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM20 <SEQ ID 3228> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM21 <SEQ ID 3229> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

2M22 <SEQ ID 3230> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM23ASBC <SEQ ID 3231> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
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KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTSKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKEPGYVWQLLPNGMKPEYRP* 

ZM24 <SEQ ID 3232> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLL PNGMKPE YRP * 

ZM25 <SEQ ID 3233> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGBCYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM26 <SEQ ID 3234> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRIAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM27BC <SEQ ID 3235> 

MKKYLFRAALYGISAAILAACQSKSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFT PWQVAGNGSLAGT VTG YYE PVLKGDDRRTAQARFP lYGIPDDFI S VPLPAGLRSGKA 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQXNGGAL 

DGKAPILGYAEDPVELFETiHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKEPGYVWQLLPNGMKPEYRP* 

ZM28 <SEQ ID 3236> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM29ASBC <SEQ ID 3237> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDF/UCSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATTHPITRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM31ASBC <SEQ ID 3238> 
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MKKHLFRAALYGIAAAILAACQSKSIQTFPQPDTSIIKGPDRPAGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMRQNPQRIAEVLGQNPSYVFFRELAGSGNDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM32ASBC <SEQ ID 3239> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGGDGPVGALGTPLMGGYAGA 
IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM33ASBC <SEQ ID 324 0> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 

YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPIHSFQAKRFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 

LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKSYMRQNPHKLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 

IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM96 <SEQ ID 3241> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSKDGPVGALGTPLMGEYAGA 

VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



Figure 23 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 919, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



. EXAMPLE 16 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PGR) assay in order to clone the ORFs as 
indicated: 



Table 7: Oligonucleotides used for PGR to ampHfy complete or partial ORFs 
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ORF primer Sequence 

001 Forward CGCGGATCCCATATG-TGGATGGTGCTGGTCAT 



Restriction 
sites 



Reverse CCCGCTCGAG-TGCCGTCTTGTCCCAC 
003 Forward CGC GGATCCCATATG -GTCGTATTCGTGGC 

Reverse CCCGCTCGAG-AAAATCATGAACACGCGC 

005 Forward CGCGGATCCCATATG-GACAATATTGACATGT 

Reverse CCCGCTCGAG-CATCACATCCGCCCG 

006 Forward CGC GGATCCCATATG -CTGCTGGTGCTGG 

Reverse CCC GCTCGAG -AGTTCCGGCTTTGATGT 

007 Forward CGCGGATCCCATATG-GCCGACAACAGCATCAT 

Reverse CCCGCTCGAG-AAGGCGTTCATGATATAAG 

008 Forward CGCGGATCCCATATG-AACAACAGACATTTTG 

Reverse CCCGCTCGAG-CCTGTCCGGTAAAAGAC 

009 Forward CGC GGATCCCATATG -CCCCGCGCTGCT 

Reverse CCCG CTCGAG -TGGCTTTTGCCACGTTTT 

011 Forward CGCGGATCCCATATG-AAGACACACCGCAAG 

Reverse CCC GCTCGAG -GGCGGTCAGTACGGT 

012 Forward CGC GGATCCCATATG -CTCGCCCGTTGCC 

Reverse CCCGCTCGAG-AGCGGGGAAGAGGCAC 

013 Forward CGCGGATCCCATATG-CCTTTGACCATGCT 

Reverse CCCGCTCGAG-CTGATTCGGCAAAAAAATCT 

018 Forward CGC GGATCCCATATG -CAGCAGAGGCAGTT 

Reverse CCC GCTCGAG -GACGAGGCGAACGCC 

019 Forward AAAGAATTC-CTGCCAGCCGGCAAGACCCCGGC 
Reverse AAACTGCAG-TCAGCGGGCGGGGACAATGCCCAT 

023 Forward AAAGAATTC-AAAGAATATTCGGCATGGCAGGC 
Reverse AAACTGCAG-TTACCCCCAAATCACTTTAACTGA 

025 Forward AAAGAATTC-TGCGCCACCCAACAGCCTGCTCC 
Reverse AAACTGCAG-TCAGAACGCGATATAGCTGTTCGG 

031 Forward CGCGGATOCCATATG-GTCTCCCTTCGCTT 

Reverse CCCGCTCGAG-ATGTAAGACGGGGACAAC 

032 Forward CGCGGATCCCATATG-CGGCGAAACGTGG 



BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 
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Reverse CCCGCTCGAG-CTGGTTTTTTGATATTTGTG 

033 Forward CGCGGATCCCATATG-GCGGCGGCAGACA 

Reverse CCCGCTCGAG-ATTTGCCGCATCCCGAT 

034 Forward CGCGGATCCCATATG-GCCGAAAACAGCTACGG 

Reverse CCC GCTCGAG -TTTGACGATTTGGTTCAATT 
036 Forward CGCGGATCCCATATG-CTGAAGCCGTGCG 

Reverse CCCGCTCGAG-CCGGACTGCGTATCGG 

038 Forward CGCGGATCCCATATG-ACCGATTTCCGCCA 

Reverse CCCGCTCGAG-TTCTACGCCGTACTGCC 

039 Forward CGCGGATCCCATATG-CCGTCCGAACCGC 

Reverse CCCGCTCGAG-TAGGATGACGAGGTAGG 

041 Forward CGC GGATCCCATATG -TTCGTGCGCGAACnnr 

Reverse CCCGCTCGAG-GCCCAAAAACTCTTTCAAA 

042 Forward CGC GGATCCCATATG -ACGATGATTTGCTTGr 

Reverse CCCGCTCGAG-TTTGCAGCCTGCATTTGAC 

043 Forward AAAAAAGGTACC-ATGGTTGTTTCAAATCAAAATATC 

Reverse AAACTGCAG-TTATTGCGCTTCACCTTCCGCCGC 

043a Forward AAAAAAGGTACC:GCAAAAGTGCATGGCGGCTTGGACGGTGC 
Reverse AAAAA ACTGCAG - 

TTAATCCTGCAACACGAATTCGCCCGTCCG 

044 Forward CGCGGATCCCATATG-CCGTCCGACTAGAG 



046 



047 



048 



049 



050 



Reverse CCCGCTCGAG-ATGCGCTACGGTAGCCA 
Forward AAAGAATTC-ATGTCGGCAATGCTCCCGACAAG 
Reverse AAACTGCAG-TCACTCGGCGACCCACACCGTGAA 
Forward CGCGGATCCCATATG-GTCATCATACAGGCG 

Reverse CCC GCTCGAG -TCCGAAAAAGCCCATTTTG 
Forward AAAGAATTC-ATGCTCAACAAAGGCGAAGAATTGCC 
Reverse AAACTGCAG-TCAAGATTCGACGGGGATGATGCC 
Forward AAAGAATTC-ATGCGGGCGCAGGCGTTTGATCAGCC 
Reverse AAACTGCAG-AAGGCGTATCTGAAAAAATGGCAG 
Forward CGC GGATCCCATATG -GGCGCGGGGTGrr 



Reverse CCCGCTCGAG-AATCGGGCCATCTTCGA 
052 Forward AAAAAAGAATTC-ATGGCTTTGGTGGCGGAGGAAAC 
Reverse AAAAAAGTCGAC-TCAGGCGGCGTTTTTCACCTTCCT 
052a Forward AAAAAAGAATTC-GTGGCGGAGGAAACGGAAATATCCGC 



Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BaraHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

Kpnl 

PstI 

Kpnl 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

Sail 

EcoRI 



wo 99/57280 



142 



PCT/US99/09346 



Reverse AAAAAACTGCAG-TTAGCTGTTTTTGGAAACGCCGTCCAACCC Pst I 

073 Forward CGC GGATCCCATATG -TGTATGCCATATAAGAT BamHI- 

Ndel 

Reverse CCC GCTCGAG -CACCGGATTGTCCGAC Xhol 
075 Forward CGC GGATCCCATATG -CCGTCTTACTTCATC BamHI- 

Ndel 

Reverse CCC GCTCGAG -ATCACCAATGCCGATTATTT xhol 
077a Forward AAAAAAGAATO-GGCGGCATTTTCATCGACACCTTCCT Eco RI 

Reverse AAAAAACTGCAG-TCAGACGAACATCTGCACAAACGCAAT Pst I 

080 Forward AAAGAATTC-GCGTCCGGGCTGGTTTGGTTTTACAATTC EcoRI 
Reverse AAACTGCAG-CTATTCTTCGGATTCTTTTTCGGG Pst I 

081 Fonvard AAAGAATTC-ATGAAACCACTGGACCTAAATTTCATCTG EcoRI 
Reverse AAACTGCAG-TCACTTATCCTCCAATGCCTC PstI 

082 Forward AAAGAATTC-ATGTGGTTGTTGAAGTTGCCTGC EcoRI 
Reverse AAACTGCAG-TTACGCGGATTCGGCAGTTGG PstI 

084 Forward AAAGAATTC-TATCACCCAGAATATGAATACGGCTACCG EcoRI 
Reverse AAACTGCAG-TTATACTTGGGCGCAACATGA Pst I 

085 Forward CGCGGATCCCATATG-GGTAAAGGGCAGGACT BamHI- 

Ndel 

Reverse CCCGCTCGAG-CAAAGCCTTAAACGCTTCG Xhol 

086 Forward AAAAAAGGTACC-TATTTGGCATCAAAAGAAGGCGG Kpnl 
Reverse AAACTGCAG-TTACTCCACCCGATAACCGCG PstI 

087 Forward AAAGAMTC-ATGGGCGGTAAAACCTTTATGC EcoRI 
Reverse AAACTGCAG-TTACGCCGCACACGCAATCGC PstI 

087a Forward AAAAAAGAATTC-AAGCTATTAGGCGTGCCGATTGTGATTCA EcoRI 
Reverse AAAAAACTGCAG-TTACGCCTGCAAGATGCCCAGCTTGCC PstI 

088 Forward AAAAAAGAATTC-ATGTTTTTATGGCTCGCACATTTCAG Eco RI 
Reverse AAAAAACTGCAG-TCAGCGGATTTTGAGGGTACTCAAACC PstI 

089 Forward CGC GGATCCCATATG -CCGCCCAAAATCAr BamHI- 

Ndel 

Reverse CCCGCTCGAG-TGCGCATACCAAAGCCA Xhol 

090 Forward CGCGGATCCCATATG-CGCATAGTCGAGCA BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGCAAAACGGCGGTACG Xhol 

091 Forward AAAGAATTC-ATGGAAATAGCCGTACCGCCGAGTCC Eco RI 
Reverse AAACTGCAG-TCAGCGCAGGGGGTAGCCCAAGCC PstI 

092 Forward AAAGAAnC-ATGTTTTTTATTTCAATCCG Eco RI 
Reverse AAACTGCAG-TCAAATCTGTTTCGACAATGC Pst I 

093 Forward AAAGAATTC-ATGCAGAATTTTGGCAAAGTGGC Eco RI 
Reverse AAACTGCAG-CTATGGCTCGTCATACCGGGC PstI 

094 Forward AAAGAATTC-ATGCCGTCACGGAAGCGCATCAACTC Eco RI 
Reverse AAACTGCAG-TTATCCCGGCCATACCGCCGAACA PstI 

095 Forward AAAGAAnC-ATGTCCTTTCATTTGAACATGGACGG EcoRI 
Reverse AAACTGCAG-TCAACGCCGCAGGCACTAACGCCC PstI 

096 Forward AAAGAATTC-ATGGCTCGTCATACCGGGCAGGG Eco RI 
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Reverse AAACTGCAG-TCAAAGGAAAAGGCCGTCTGAAAAGCG PitI — 

097 Forward AAAGAATTC-ATGGACACTTCAAAACAAACACTGTTG EcoRI 
Reverse AAACTGCAG-TCAGCCCAAATACCAGAATTTCAG PstI 

098 Forward AAAGAATTC-GATGAACGCAGCCCAGCATGGATACG EcoRI 
Reverse AAACTGCAG-TTACGACATTCTGATTTGGCA PstI 

102 Forward AAAAAAGAATTC-GGCCTGATGATTTTGGAAGTCAACAC EcoRI 

Reverse AAAAAACTGCAG-TTATCCTTTAAATACGGGGACGAGTTC PstI 

105 Forward CGCGGATCCCATATG-TCCGCAAACGAATACG BamHI- 

Ndel 

Reverse CCCGCTCGAG-GTGTTCTGCCAGTTTCAG Xhol 

107 Forward AAAAA AGAATTC - Eco RI 

CTGATGATTTTGGAAGTCAACACCCATTATCC 

Reverse AAAAAACTGCAG-TTATCCTTTAAATACGGGGACGAGTTC PstI 

107b Forward AAAAA AGAATTC - EcoRI 

GATACCCAAGCCCCCGCCGGCACAAACTACTG 
Reverse AAAAAA CTGCAG - Psti 

TTACGCGTCGCCTTTAAAGTATTTGAGCAGGCTGGAGAC 

108 Forward AAAGAATTC-ATGTTGCCGGGCTTCAACCG Eco RI 
Reverse AAACTGCAG-TTAGCGGTACAGGTGTTTGAAGCA PstI 

108a Forward AAAAAAGAATTC-GGTAACACATTCGGCAGCTTAGACGGTGG Eco RI 
Reverse AAACTGCAG-TTAGCGGTACAGGTGTTTGAAGCA PstI 

109 Forward AAAGAATTC-ATGTATTATCGCCGGGTTATGGG Eco Rl 
Reverse AAACTGCAG-CTAGCCCAAAGATTTGAAGTGTTC PstI 

111 Forward CGCGGATCCCATATG-TGTTCGGAACAAACCGC BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCGGAGCAGTTTTTCAAA xhol 
114 Forward CGC GGATCCCATATG -GCTTCCATrArTTmr BamHI- 

Ndel 

Reverse CCCGCTCGAG-CATCCGCGAAATCGTC Xhol 

117 Forward AAAAAAGGTACC-ATGGTCGAAGAACTGGAACTGCTG Kpnl 
Reverse AAACTGCAG-TTAAAGCCGGGTAACGCTCAATAC PstI 

118 Forward AAAGTCGACATGTGTGAGTTCAAGGATATTATAAG Sail 
Reverse AAAGCATGC-CTATTTTTTGTTGTAATAATCAAATC SphI 

121 Forward CGC GGATCCCATATG -GAAACACAGCTTTArAT BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATAATAATATCCCGCGCCC Xhol 

122 Forward CGCGGATCCCATATG-GTCATGATTAAAATCCGCA BamHI- 

Ndel 

Reverse CCCGCTCGAG-AATCTTGGTAGATTGGATTT xhol 

125 Forward AAAGAATTC-ATGTCGGGCAATGCCTCCTCTCC Eco RI 
Reverse AAACTGCAG-TCACGCCGTTTCAAGACG PstI 

125a Forward AAAAAAGAATTC-ACGGCAGGCAGCACCGCCGCACAGGTTTC Eco RI 
Reverse AAAAA ACTGCAG - Pgjj 

TTATTTTGCCACGTCGGTTTCTCCGGTGAACAACGC 

126 Forward CGCGGATCCCATATG-CCGTCTGAAACCC BamHI 



# 
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Ndel 

Reverse CCCGCTCGAG-ATATTCCGCCGAATGCC Xhol 

127 Forward AAAGAATTC-ATGGAAATATGGAATATGTTGGACACTTG EcoRI 
Reverse AAACTGCAG-TTAAAGTGTTTCGGAGCCGGC PstI 

127a Forward AAAAAAGAATTC-AAGGAACTGATTATGTGTCTGTCGGG Eco RI 

Reverse AAACTGCAG-TTAAAGTGTTTCGGAGCCGGC PstI 

128 Forward CGCGGATCCCATATG-ACTGACAACGCACT BamHI- 

Ndel 

Reverse CCCGCTCGAG-GACCGCGTTGTCGAAA xhol 

130 Forward CGCGGATCCCATATG-AAACAACTCCGCGA BamHI- 

Ndel 

Reverse CCCGCTCGAG-GAATTTTGCACCGGATTG Xhol 

132 Forward AAAGAATTC-ATGGAACCCTTCAAAACCTTAATTTG Eco RI 

Reverse AAAAAACTGCAG-TCACCATGTCGGCATTTGAAAAAC PstI 

134 Forward CGCGGATCCCATATG-TCCCAAGAAATCCTC BamHI- 

Ndel 

Reverse CCCGCTCGAG-CAGTTTGACCGAATGTTC Xhol 

135 Forward CGC GGATCCCATATG -AAATACAAAAGAATrrrTATT BamHI- 

Ndel 

Reverse CCCGCTCGAG-AAATTCGGTCAGAAGCAGG Xhol 

137 Forward AAAAAAGGTACC-ATGATTACCCATCCCCAATTCGATCC Kpnl 
Reverse AAAAAACTGCAG-TCAGTGCTGTTTTTTCATGCCGAA PstI 

137a Forward AAAAAAGAATTC-GGCCGCAAACACGGCATCGGCTTCCT EcoRI 

Reverse AAAAAACTGCAG-TTAAGCGGGATGACGCGGCAGCATACC PstI 

138 Forward AAAAAAGAiOTC-AACTCAGGCGAAGGAGTGCTTGTGGC EcoRI 
Reverse AAAAAATCTAGA:TCAGTTTAGGGATAGCAGGCGTAC Xbal 

141 Forward AAAGAATTC-ATGAGCTTCAAAACCGATGCCGAAATCGC EcoRI 
Reverse AAACTGCAG-TCAGAACAAGCCGTGAATCACGCC PstI 

142 Forward CGCGGATCCCATATG-CGTGCCGATTTCATG BamHI- 

Ndel 

Reverse CCCGCTCGAG-AAACTGCTGCACATGGG Xhol 

143 Forward AAAAA AGAATTC - Eco RI 

ATGCTCAGTTTCGGCTTTCTCGGCGTTCAGAC 

Reverse AAAAAACTGCAG-TCAAACCCCGCCGTGTGTTTCTTTAAT PstI 

144 Forward AAAAAAGAATTC-GGTCTGATCGACGGGCGTGCCGTAAC Eco RI 
Reverse AAAAAATCTAGA-TCGGCATCGGCCGGCATATGTCCG Xbal 

146 Forward AAAAA AGAATTC - Eco RI 

CGCCAAGTCGTCATTGACCACGACAAAGTC 

Reverse AAAAAACTGCAG-TTAGGCATCGGCAAATAGGAAACTGGG PstI 

147 Forward AAAAAAGAATTC-ACTGAGCAATCGGTGGATTTGGAAAC EcoRI 
Reverse AAAAAATCTAGA-TTAGGTAAAGCTGCGGCCCATTTGCGG Xbal 

148 Forward AAAAA AGAATTC - Eco RI 

ATGGCGTTAAAAACATCAAACTTGGAACACGC 

Reverse AAAAAATCTAGA-TCAGCCCTTCATACAGCCTTCGTTTTG Xba I 

149 Forward CGCGGATCCCATATG-CTGCTTGACAArAAAnT BamHI- 
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Reverse CCCGCTCGAG-AAACTTCACGTTCACGCC 
Forward CGCGGATCCCATATG-CAGAACACAAATCCG 



Reverse CCCGCTCGAG-ATAAACATCACGCTGATAGC 

151 Forward AAAAA AGAATTC - 

ATGAAACAAATCCGCAACATCGCCATCATCGC 
Reverse AAAAAACTGCAG-TCAATCCAGCTTTTTAAAGTGGCGGCG 

152 Forward AAAAA AGAATTC - 

ATGAAAAACAAAACCAAAGTCTGGGACCTCCC 
Reverse AAAAAACTGCAG-TCAGGACAGGAGCAGGATGGCGGC 

153 Forward AAAAAAGAATTC-ATGGCGTTTGCTTACGGTATGAC 

Reverse AAAAAACTGCAG-TCAGTCATGTTTTTCCGTTTCATT 

153a Forward AAAAAAGAiVITC-CGGACTTCGGTATCGGTTCCCCAGCATTG 
Reverse AAAAA ACTGCAG - 

TTACGCCGACGAAATACTCAGACmrCGG 

154 Forward CGCGGATCCCATATG-ACTGACAACAGCCC 



155 



156 



157 



Reverse CCCGCTCGAG-TCGGCTTCCTTTCGGG 
Forward AAAAAAGAATTC-ATGAAAATCGGTATCCCACGCGAGTC 
Reverse AAAAAACTGCAG-TTACCCTTTCTTAAACATATTCAGCAT 
Forward AAAAAAGAATTC-GCACAGCAAAACGGTTtTGAAGC 
Reverse AAAAAACTGCAG-TCAAGCAGCCGCGACAAACAGCCC 
Forward CGCGGATCCCATATG-AGGAACGAGGAAAAAC 



Reverse CCCGCTCGAG-AAAACACAATATCCCCGC 
158 Forward AAAAAAGAATTC-GCGGAGCAGTTGGCGATGGCAAATTCTGC 

Reverse AAAAAATCTAGA-TTATCCACAGAGATTGTTTCCCAGTTC 
160 Forward CGCGGATCCCATATG-GACATTCTGGACAAAC 

Reverse CCC GCTCGAG -TTTTTGCCCGCCTTCTTT 

163 Forward AAAAAAGGTACC-ACCGTGCCGGATCAGGTGCAGATGTG 
Reverse AAAAAATCTAGA-TTACTCTGCCAATTCCACCTGCTCGTG 

163a Forward AAAAAAGAATTC-CGGCTGGTGCAGATAATGAGCCAGAC 
Reverse AAAAAATCTAGA-TTACTCTGCCAATTCCACCTGCTCGTG 

164 Forward CG CGGATCCCATATG -AACCGGACTTATOrr 

Reverse CCCGCTCGAG-TTTGTTTCCGTCAAACTGC 

165 Forward CGCGGATOCGCTAGC-GCTGAAGCGACAGACG 

Reverse CCCGCTCGAG-AATATCCAATACTTTCGCG 
206 Forward CGCGGATCCCATATG-AAACACCGCCAACCGA 

Reverse CCCGCTCGAG-TTCTGTAAAAAAAGTATGTGC 
209 Forward CGCGGATCCCATATG-CTGCGGCATTTAGGA 



Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI . 
EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

Xbal 

BamHI- 

Ndel 

Xhol 

Kpnl 

Xbal 

EcoRI 

Xbal 

BamHI- 

Ndel 

Xhol 

BamHI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 
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Reverse CCC GCTCGAG -TACCCCTGAAGGCAAC JOioi — 

211 Forward AAAAA AGAATTC -ATGTTGCGGGTTGCTGCTGC EcoRI 
Reverse AAAAAACTGCAG-CTATCCTGCGGATTGGCATTGAAA PstI 

212 Forward CG CGGATCCCATATG -GACAATCTCGTATGG BamHI- 

Ndel 

Reverse CCC GCTCGAG -AGGGGTTAGATCCTTCC Xhol 

215 Forward CGC GGATCCCATATG -GCATGGTTGGGTCGT BamHI- 

Ndel 

Reverse CCCGCTCGAG-CATATCTTTTGTATCATAAATC Xhol 

216 Forward CG CGGATCCCATATG -GCAATGGCAGAAAACG BamHI- 

Ndel 

Reverse CCC GCTCGAG -TACAATCCGTGCCGCC Xhol 

217 Forward CG CGGATCCCATATG -GCGGATGACGGTGTG BamHI- 

Ndel 

Reverse CCCGCTCGAG-ACCCCGAATATCGAATCC Xhol 

218 Forward CGC GGATCCCATATG -GTCGCGGTCGATC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TAACTCATAGAATCCTGC Xhol 

219 Forward CGC GGATCCGCTAGC -ACGGCAAGGTTAAG BamHI- 

Nhel 

Reverse CCCGCTCGAG-TTTAAACCATCTCCTCAAAAC Xhol 
223 Forward CGCGGATCCCATATG-GAATTCAGGCACCAAGTA BamHI- 

Ndel 

Reverse CCCGCTCGAG-GGCTTCCCGCGTGTC Xhol 

225 Forward CGCGGATCCCATATG-GACGAGTTGACCAACC BamHI- 

Ndel 

Reverse CCC GCTCGAG -GTTCAGAAAGCGGGAC Xhol 

226 Forward AAAGAATTC-CTTGCGATTATCGTGCGCACGCG Eco RI 
Reverse AAACTGCAG-TCAAAATCCCAAAACGGGGAT PstI 

228 Forward CGCGGATCCCATATG-TCGCAAGAAGCCAAACAG BamHI- 

Ndel 

Reverse CCC GCTCGAG- TTTGGCGGCATCTTTCAT xhol 

229 Forward CGC GGATCCCATATG -CAAGAGGTTTTGCCC: BamHI- 

Ndel 

Reverse CCCGCTCGAG-ACACAATATAGCGGATGAAC Xhol 

230 Forward CGCGGATCCCATATG-CATCCGGGTGCCGAC BamHI- 

Ndel 

Reverse CCCGCTCGAG-AAGTTTGGCGGCTTCGG Xhol 

232 Forward AAAAAAGAATTC-ATGTACGCTAAAAAAGGCGGTTTGGG Eco RI 
Reverse AAAAAACTGCAG-TCAAGGTTTTTTCCTGATTGCCGCCGC " PstI 

232a Forward AAAAAAGAATTC-GCCAAGGCTGCCGATACACAAATTGA Eco RI 
Reverse AAAAAACTGCAG-TTAAACATTGTCGTTGCCGCCCAGATG PstI 

233 Forward CGC GGATCCCATATG -GCGGACAAACCC A AG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GACGGCATTGAGCAG Xhol 

234 Forward CGCGGATCCCATATG-GCCGTTTCACTGACCG BamHI- 
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Ndel 




Reverse GCCCAAGCTT-ACGGTTGGATTGCCATG 


Hind m 


235 


Forward CGCGGATCCCATATG-GCCTGCCAAGTTCAAA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTGGGCTGCTCTTC 


Xhol 


236 


Forward CGCGGATCCCATATG-GCGCGTTTCGCCTT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-ATGGGTCGCGCGCCGT 


Xhol 


238 


Forward CGCGGATCCGCTAGC-AACGGTTTGGATGCCCn 


BamHI- 






Nhel 




Reverse CCCGCTCGAG-TTTGTCTAAGTTCCTGATATG 


Xhol 


239 


Forward CCGGAATTCTACATATG-CTCCACCATAAAGGTATTG 


EcoRI- 






Ndel 




Reverse CCCGCTCGAG-TGGTGAAGAGCGGTTTAG 


Xhol 


240 


Forward CGCGGATCCCATATG-GACGTTGGACGATTTr 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AAACGCCATTACCCGATG 


Xhol 


241 


Forward CCGGAATTCTACATATG-CCAACACGTCCAACT 


EcoRI- 






Ndel 




Reverse CCCGCTCGAG-GAATGCGCCTGTAATTAATC 


Xhol 


242 


Forward CGCGGATCCCATATG-ATCGGCAAACTTGTTG 


BamHI- 






Ndel 




Reverse GCCCAAGCTT-ACCGATACGGTCGCAG 


Hindlll 


243 


Forward CGCGGATCCCATATG-ACGATTTTTTCGATGTTrTr 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CGACTTGGTTACCGCG 


Xhol 


244 


Forward CGCGGATCCCATATG-CCGTCTGAAGCCC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTTTTCGGTAGGGGATTT 


Xhol 


246 


Forward CGCGGATCCCATATG-GACATCGGCAGTGr 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CCCGrGrrnrTrirrAf; 


Xhol 


247 


Forward CGCGGATCCCATATG-GTCGGATCGAGTTAC 


BamHI- 






Ndel 




Reverse CCCGCTCCwAO- A A nTCTTTnTrvrTrnnnn a 


Xhol 


248 


Forward CGCGGATCCCATATG-CGCAAACAGAArArT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CTCATCATTATTGCTAACA 


Xhol 


249 


Forward CGCGGATCCCATATG-AAG A AT A ATf; A TmrTrr 


BamHI- 




Reverse CCCGCTCGAG-TTCCCGACCTCCGAC 


Ndel 
Xhol 


251 


Forward CGCGGATCCCATATG-CGTGCTGCGGTAGT 


BamHI- 




Reverse CCCGCTCGAG-TACGAAAGCCGGTCGTG 


Ndel 
Xhol 


253 


Forward AAAAAAGAATTC-ATGATTGACAGGAArnnTATnrTGrn 


EcoRI 




Reverse AAAAAACTGCAG-TTATTGGTCTTTCAAACGCCCTTCCTG 


PstI 
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253a Forward AAAAA AGAATTC -AAAATCCTTTTGAAAACAAGCGAAAAnrTn F.r.n RT 
Reverse AAAAAACTGCAG-TrATTGGTCTTTCAAACGCCCTTCCTG PstI 

254 Forward AAAAAAGAATTC-ATGTATACAGGCGAACGCTTCAATAC Eco RI 
Reverse AAAAAATCTAGA-TCAGATTACGTAACCGTACACGCTGAC Xbal 

255 Forward CG CGGATCCCATATG -GCCGCGTTGCGTTAC BamHI- 

Ndel 

Reverse CCC GCTCGAG -ATCCGCAATACCGACCAG Xhol 

256 Forward CGCGGATCCGCTAGC-TTTTAACACCGCCGGAC BamHI- 

Nhel 

Reverse CCC GCTCGAG -ACGCCTGTTTGTGCGG Xhol 

257 Forward CG CGGATCCCATATG -GCGGTTTCTTTCCTG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GCGCGTGAATATCGCG Xhol 

258 Forward AAAAAAGAATTC-GATTATTTCTGGTGGATTGTTGCGTTCAG Eco RI 
Reverse AAAAAACTGCA^-CTACGCATAAGTTTTTACCGTTTTTGG PstI 

258a Forward AAAAAAGAATTC-GCGAAGGCGGTGGCGCAAGGCGA Eco RI 

Reverse AAAAA ACTGCAG -CTACGCATAAGTTTTTACCGTTTTTGrT PstI 

259 Forward CGCGGATCCCATATG-GAAGAGCTGCCTCCG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GGCTTTTCCGGCGTTT Xhol 

260 Forward CGC GGATCCCATATG -GGTGCGGGTATGGT BamHI- 

Ndel 

Reverse CCC GCTCGAG -AACAGGGCGACACCCT Xhol 

261 Forward AAAAAAGAATTC-CAAGATACAGCTCGGGCATTCGC EcoRI 
Reverse AAAAAACTGCAG-TCAAACCAACAAGCCTTGGTCACT PstI 

263 Forward CG CGGATCCCATATG -GCACGTTTAACCGTA BamHI- 

Ndel 

Reverse CCC GCTCGAG -GGCGTAAGCCTGCAATT xhol 

264 Forward AAAAAAGGTACC-GCCGACGCAGTGGTCAAGGCAGAA Kpnl 
Reverse AAACTGCAG-TCAGCCGGCGGTCAATACCGCCCG PstI 

265 Forward AAAAAAGAATTC-GCGGAGGTCAAGAGAAGGTGTTTG EcoRI 
Reverse AAAAAACTGCAG-TTACGAATACGTCGTCAAAATGGG PstI 

266 Forward AAAGAATTC-CTCATCTTTGCCAACGCCCCCTTC Eco RI 
Reverse AAACTGCAG-CTATTCCCTGTTGCGCGTGTGCCA PstI 

267 Forward AAAGAATTC-TTCTTCCGATTCGATGTTAATCG Eco RI 
Reverse AAACTGCAG-TTAGTAAAAACCTTTCTGCTTGGC PstI 

269 Forward AAAGAATTC-TGCAAACCTTGCGCCACGTGCCC Eco RI 
Reverse AAACTGCAG-TTACGAAGACCGCAACGAAAGGCAGAG PstI 

269a Forward AAjVAAAGAATTC-GACTTTATCCAAAACACGGCTTCGCC Eco RI 
Reverse AAACTGCAG-TTACGAAGACCGCAACGAAAGGCAGAG PstI 

270 Forward AAAGAMTC-GCCGTCAAGCTCGTTTTGTTGCAATG Eco RI 
Reverse AAACTGCAG-TTATTCGGCGGTAAATGCCGTCTG PstI 

271 Forward CGCGGATCCCATATG-CCTGTGTGCAGCTCGAC BamHI- 

Ndel - 

Reverse CCCGCTCGAG-TCCCAGCCCCGTGGAG Xhol 
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272 Forward AA AGAATTC -ATGACCGCAAAGGAAGAACTrTTTrrrr EcoRI 
Reverse AAACTGCAG-TCAGAGCAGTTCCAAATCGGGGCT PstI 

273 Forward AAAGAATTC-ATGAGTCTTCAGGCGGTATTTATATACCC EcoRI 
Reverse AAACTGCAG-TTACGCGTAAGAAAAAACTGC PstI 

274 Forward CGC GGATCCCATATG -ACAGATTTGGTTACGGAC BamHI- 

Ndel 

Reverse CCCGCTTO^-TTTGCTTTCAGTATTATTGAA Xhol 

276 Forward AAAAA AGAATTC - Eco RI 

ATGATTTTGCCGTCGTCCATCACGATGATGCG 
Reverse AAAAAACTGCAG-CTACACCACCATCGGCGAATTTATGGC PstI 

277 Forward AAAAAAGAAnC-ATGCCGCGCTTTGAGGACAAGCTCGTAGG EcoRI 
Reverse AAAAAACTGCAG-TCATAAGCCATGCTTACCTTCCAACAA PstI 

277a Forward AAAAAAGAATTC-GGGGCGGCGGCTGGGTTGGACGTAGG Eco Rl 
Reverse AAAAAACTGCAG-TCATAAGCCATGCTTACCTTCCAACAA PstI 

278 Forward AAAAAAGGTACC-GTCAAAGTTGTATTAATCGGGCCTTTGCC Kpnl 
Reverse AAAAAACTGCAG-TCATTCAACCATATCAAATCTGCC PstI 

278a Forward AAAAAAGAATTC-AAAACTCTCCTAATTCGTCATAGTCG Eco RI 
Reverse AAAAAACTGCAG-TCATTCAACCATATCAAATCTGCC PstI 

279 Forward CGCGGATCCCATATG-TTGCCTGCAATCACGATT BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTAGAAGCGGGCGGCAA Xhol 

280 Forward AAAAAAGGTACC-GCCCCCCTGCCGGTTGTAACCAG Kpnl 
Reverse AAAAAACTGCAG-TTATTGCTTCATCGCGTTGGTCAAGGC PstI 

281 Forward AAAAAAGAATTC-GCACCCGTCGGCGTATTCCTCGTCATGCG EcoRI 
Reverse AAAAAATCTAGA-GGTCAGAATGCCGCCTTCTTTGCCGAG Xbal 

281a Forward AAAAAAGAATTC-TCCTACCACATCGAAATTCCTTCCGG EcoRI 
Reverse AAAAAATCTAGA-GGTCAGAATGCCGCCTTCTTTGCCGAG Xbal 

282 Forward AAAAAAGAATTC-CTTTACCTTGACCTGACCAACGGGCACAG EcoRI 
Reverse AAAAAACTGCAG-TCAACCTGCCAGTTGCGGGAATATCGT Pst I 

283 Forward CGCGGATCCCATATG-GCCGTCTTTACTTGGAAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-ACGGCAGTATTTGTTTACG Xhol 

284 Forward CGCGGATCCCATATG-TTTGCCTGCAAAAGAATCG BamHI- 

Ndel 

Reverse CCCGCTCGAG-CCGACTTTGCAAAAACTG Xhol 

286 Forward CGCGGATCCCATATG-GCCGACCTTTCCGAAAA BamHI- 

Ndel 

Reverse CCCGCTCGAG-GAAGCGCGTTCCCAAG Xhol 

287 Forward CCGGAATTCTAGCTAGC-CTTTCAGCCTGCGGG EcoRI- 

Nhel 

Reverse CCCG CTCGAG -ATCCTGCTCTTTTTTGCG xhol 

288 Forward CGCGGATCCCATATG-CACACCGGACAGG BamHI- 

Ndel 

Reverse CCCGCTCGAG-CGTATCAAAGACTTGCGT Xhol 
290 Forward CGCGGATCCCATATG-GCGGTTTGGGGrnpTA BamHI- 
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Reverse CCC GCTCGAG -TCGGCGCGGCGGGC 
292 Forward CGCGGATCCCATATG-TGCGGGCAAACGCCC 

Reverse CCC GCTCGAG -TTGATTTTTGCGGATGATTT 

294 Forward AAAAAAGAATTC-GTCTGGTCGATTCGGGTTGTCAGAAC 
Reverse AAAAAACTGCAG-TTACCAGCTGATATAAAACATCGCTTT 

295 Forward CG CGGATCCCATATG -AACCGGCCGGCCTCC: 

Reverse CCCGCTCGAG-CGATATTTGATTCCGTTGC 

297 Forward AAAAAAGAATTC-GCATACATTGCTTCGACAGAGAG 
Reverse AAAAAACTGCAG-TCAATCCGATTGCGACACGGT 

298 Forward AAAAAAGAATTC-CTGATTGCCGTGTGGTTCAGCCAAAACCC 
Reverse AAAAAACTGCAG-TCATGGCTGTGTACTTGATGGTTGCGT 

299 Forward CGCGGATCCGCTAGC-CTACCTGTCGCCTCCG 

Reverse CCC GCTCGAG -TTGCCTGATTGCAGCGG 
302 Forward AAAAAAGAATTC-ATGAGTCAAACCGATACGCAACG 

Reverse AAAAAACTGCAG-TTAAGGTGCGGGATAGAATGTGGGCGC 

305 Forward AAAAAAGGTACC-GAATTTTTACCGATTTCCAGCACCGGA 
Reverse AAAAAACTGCAG-TCATTCCCAACTTATCCAGCCTGACAG 

305a Forward AAAAAAGGTACC-TCCCGTTCGGGCAGTACGATTATGGG 
Reverse AAAAAACTGCAG-TTACAAACCGACATCATGCAGGGTGAA 

306 Forward CGCGGATCCCAIATG-TTTATGAACAAATTTTCCC 

Reverse CCC GCTCGAG -CCGCATCGGCAGAC 
308 Forward CG CGGATCCCATATG -TTAAATCGGGTATTTTATr 



311 



312 



313 



401 



Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 
Reverse 
Forward 



CCCGCTCGAG-ATCCGCCATTCCCTGC 

AAAAAAGGTACC-ATGTTCAGTTTTGGCTGGGTGTTT 

AAACTGCAG-ATGTTCATATTCCCTGCCTTCGGC 

AAAAAAGGTACC-ATGAGTATCCCATCCGGCGAAATT 

AAACTGCAG-TCAGTTTTTCATCGATTGAACCGG 

AAAAAAGAATTC-ATGGACGACCCGCGCACCTACGGATC 

AAAAAACTGCAG-TCAGCGGCTGCCGCCGATTTTGCT 

CGCGGATCCCATATG-AAGGCGGCAACACAGC 



Reverse CCCGCTCGAG-CCTTACGTTTTTCAAAGCC 
402 Forward AAAAAAGAATTC-GTGCCTCAGGCATTTTCATTTACCCTTGC 

Reverse AAAAAATCTAGA-TTAAATCCCTCTGCCGTATTTGTATTC 
402a Forward AAAAAAGAATTC-AGGCTGATTGAAAACAAACACGG 

Reverse AAAAAATCTAGA-TTAAATCCCTCTGCCGTATTTGTATTC 
406 Forward CGCGGATCCCATATG-TGCGGGACACTGACAG 

Reverse CCCGCTCGAG-AGGTTGTCCTTGTCTATG 



Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Nhel 

Xhol 

EcoRI 

PstI 

Kpnl 

PstI 

Kpnl 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

Kpnl 

PstI 

Kpnl 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

Xbal 

EcoRI 

Xbal 

BamHI- 

Ndel 

Xhol 
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501 


Forward CGCGGATCCCATATG-GCAGGCGGAGATGGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GGTGTGATGTTCACCC 


Xhol 


502 


Forward 


CGCGGATCCCATATG-GTAGACGCGCTTAAGCA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AGCTGCATGGCGGCG 


Xhol 


503 


Forward 


CGCGGATCCCATATG-TGTTCGGGGAAAGGCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CCGCGCATTCCTCGCA 


Xhol 


504 


Forward 


CGCGGATCCCATATG-AGCGATATTGAAGTGACG 


BamHI- 








Ndel 




Reverse 


GCCCAAGCTT-TGATTCAAGTCCTTGCCG 


Hindni 


505 


Forward 


CGCGGATCCCATATG-TTTCGTTTACAATTCAGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CGGCGTTTTATAGCGG 


Xhol 


510 


Forward 


CGCGGATCCCATATG-CCTTCGCGGACAC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCGCACTGGCAGCG 


Xhol 


512 


Forward 


CGCGGATCCCATATG-GGACATGAAGTAACGGT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AGGAATAGCCTTTGACG 


Xhol 


515 


Forward 


CGCGGATCCCATATG-GAGGAAATAGCCTTCGA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AAATGCCGCAAAGCATC 


Xhol 


516 


Forward 


CGCGGATCCCATATG-TGTACGTTGATGTTGTGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGCGGGCGGCATC 


Xhol 


517 


Forward 


CGCGGATCCCATATG-GGTAAAGGTGTGGAAATA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GTGCGCCCAGCCGT 


Xhol 


518 


Forward 


AAAGAATTC-GCTTTTTTACTGCTCCGACCGGAAGG 


EcoRI 




Reverse 


AAACTGCAG-TCAAATTTCAGACTCTGCCAC 


PstI 


519 


Forward 


CGCGGATCCCATATG-TTCAAATCCTTTGTCGTrA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGGCGGTTTTGCTGC 


Xhol 


520 


Forward 


CGCGGATCCCATATG-CCTGCGCTTCTTTCA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ATATTTACATTTCAGTCGGC 


Xhol 




Forward 


CGCGGATCCCATATG-GCCAAAATCTATACCTfiT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CATACGCCCCAGTTCC 


Xhol 


522 


Forward 


CGCGGATCCCATATG-ACTGAGCCGAAACAC 


BamHI- 




Reverse 


GCCCAAGCTT-TTCTGATTTCAAATCGGCA 


Ndel 
Hindin 


523 


Forward 


CGCGGATCCCATATG-GCTCTGCTTTrrrTTG 


BamHI- 








Ndel 
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Reverse CCCGCTCGAG-AGGGTGTGTGATAATAAGAAG — 

525 Forward CG CGGATCCCATATG -GCCGAAATGGTTn A A Axr BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCCCGTGCATATCATAAA Xhol 
527 Fomard AAAGAATTC-TTCCCTCAATGTTGCCGTTTTCG EcoRI 
Reverse AAACTGCAG-TTATGCTAAACTCGAAACAAATTC PstI 

529 Forward CGCGGATCCGCTAGC-TGCTCCGGCAGCAAAAC BamHI- 

Nhel 

Reverse GCCCAAGCTT-ACGCAGTTCGGAATGGAG HindlH 

530 Forward CGCGGATCCCATATG-AGTGCGAGCGCGG BamHI- 

Ndel 

Reverse CCCGCTCGAG-ACGACCGACTGATTCCG Xhol 

531 Forward AAAAAAGAATTC-TATGCCGCCGCCTACCAAATCTACGG EcoRI 
Reverse AAAAAACTGCAG-TTAAAACAGCGCCGTGCCGACGACAAG PstI 

532 Forward AAAAAAGAATTC-ATGAGCGGTCAGTTGGGCAAAGGTGC EcoRI 
Reverse AAAAAACTGCAG-TCAGTGTTCCAAGTGGTCGGTATCAAA Pst I 

532a Forward AAAAAAGAATTC-TTGGGTGTCGCGTTTGAGCCGGAAGT EcoRI 
Reverse AAAAAACTGCAG-TCAGTGTTCCAAGTGGTCGGTATCAAA PstI 

535 Forward AAAGAATTC-ATGCCCTTTCCCGTTTTCAGAC Eco RI 

Reverse AAACTGCAG-TCAGACGACCCCGCCTTCCCC PstI 

537 Forward CGC GGATCCCATATG -CATACCCAAAArrAATrr BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATCCTGCAAATAAAGGGTT xhol 

538 Forward CG CGGATCCCATATG -GTCGAGCTGGTn A A ACr BamHI- 

Ndel 

Reverse CCCGCTCGAG-TGGCATTTCGGTTTCGTC Xhol 

539 Forward CGC GGATCCGCTAGC -GAGGATTTGC AGGA A A BamHI- 

Nhel 

Reverse CCCGCTCGAG-TACCAATGTCGGCAAATC Xhol 

542 Forward AAAGAATTC-ATGCCGTCTGAAACCGTGTC EcoRI 
Reverse AAACTGCAG-TTACCGCGAACCGGTCAGGAT PstI 

543 Forward AAAAAAGAATTC-GCCTTCGATGGCGACGTTGTAGGTAC Eco RI 
Reverse AAAAA ATCTAGA - Xbal 

TTAATGAAGAAGAACATATTGGAATTTTGG 
543a Forward AAAAAAGAATTC-GGCAAAACTCGTCATGAATTTGC ' Eco RI 

Reverse AAAAA ATCTAGA - Xbal 
TTAATGAAGAAGAACATATTGGAATTTTGG 

544 Forward AAAGAATJC-GCGCCCGCCTTCTCCCTGCCCGACCTGCACGG EcoRI 
Reverse AAACTGCAG-CTATTGCGCCACGCGCGTATCGAT PstI 

544a Forward AAAAA AGAATTC - Eco RI 

GCAAATGACTATAAAAACAAAAACTTCCAAGTACTTGC 

Reverse AAACTGCAG-CTATTGCGCCACGCGCGTATCGAT Pst I 

547 Forward AAAGAATTC-ATGTTCGTAGATAACGGATTTAATAAAAC EcoRI 
Reverse AAACTGCAG-TTAACAACAAAAAACAAACCGCTT PstI 

548 Forward AAAGAATTC-GCCTGCAAACCTCAAGACAACAGTGCGGC EcoRI 
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Reverse AAA CTGCAG -TCAGAGCAGGGTCCTTACATrrTTTr 
550 Forward AAAAAA GTCGAC - 

ATGATAACGGACAGGTTTCATCTCTTTCATTTTCC 

Reverse AAACTGCAG-TTACGCAAACGCTGCAAAATCCCC 
550a Forward AAAAAAGAATTC-GTAAATCACGCCTTTGGAGTCGCAAACGG 

Reverse AAACTGCAG-TTACGCAAACGCTGCAAAATCCCC 
552 Forward AAAAAAGAATTC-TTGGCGCGTTGGCTGGATAC 

Reverse AAACTGCAG-TTATTTCTGATGCCTTTTCCCAAC 
554 Forward CGCGGATCCCATATG-TCGCCCGCGCCCAAC 

Reverse CCCGCTCGAG-CTGCCCTGTCAGACAC 

556 Forward AAAGAAITC-GCGGGCGGTTTTGTTTGGACATCCCG 
Reverse AAACTGCAG-TTAACGGTGCGGACGTTTCTGACC 

557 Forward CGCGGATCCCATATG-TGCGGTTTCCACCTGAA 



558 



Reverse 
Forward 
Reverse 
558a Forward 
Reverse 
560 Forward 



561 



Reverse 
Forward 



Reverse 
562 Forward 

Reverse 

565 Forward 

Reverse 

566 Forward 

Reverse 

567 Forward 

Reverse 

568 Forward 

Reverse 

569 Forward 
Reverse 

570 Forward 



CCC GCTCGAG -TTCCGCCTTCAGAAAGG 

AAAGAAnC-GAGCTTTATATGTTTCAACAGGGGACGGC 

AAACTGCAG-CTAAACAATGCCGTCTGAAAGTGGAGA 

AAAAAAGAATTC-ATTAGATTCTATCGCCATAAACAGACGGG 

AAAAAACTGCAG-CTAAACAATGCCGTCTGAAAGTGGAGA 

AAAAA AGAATTC - 

TCGCCTTTCCGGGACGGGGCGCACAAGATGGC 

AAAAAACTGCAG-TCATGCGGTTTCAGACGGCATTTTGGC 

CC GGAATTC TACATATG-ATACTGCCAGnrrnT 

CCCGCTCGAG-TTTCAAGCTTTCTTCAGATG 
CGCGGATCCCATATG-GCAAGCCCGTCGAG 

CCCGCTCGAG-AGACCAACTCCAACTCGT 
CGCGGATCCCATATG-AAGTCGAGCGCGAAATAC 

CCCGCTCGAG-GGCATTGATCGGCGGC 
CGCGGATCCCATATG-GTCGGTGGCGAAGAGG 

CCCGCTCGAG-CGCATGGGCGAAGTCA 
CC GGAATTC T ACATATG -AGTGCGAACATCCTTG 

CCCGCTCGAG-TTTCCCCGACACCCTCG 
CGCGGATCCCATATG-CTCAGGGTCAGACC 

CCCGCTCGAG-CGGCGCGGCGTTCAG 
AAAAAAGAiVrTC-CTGATTGCCTTGTGGGAATATGCCCG 
AAAAAACTGCAG-TTATGCATAGACGCTGATAACGGCAAT 
CGCGGATCCCATATG-GACACCTTCCAAAAAATCG 



Reverse CCCGCTCGAG-GCGGGCGTTCATTTrTTT 



PstI 
Sail 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BcoRI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 
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571 Forward AAAAA AGAATTC - ggoRj 

ATGGGTATTGCCGGCGCCGTAAATGTTTTGAACCC 
Reverse AAAAAACTGCAG-TTATGGCCGACGCGCGGCTACCTGACG PstI 

572 Forward CGCGGATCCCATATG-GCGCAAAAAGGCAAAACC BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCGCAGTGTGCCGATA Xhol 

573 Forward CGCGGATCCCATATG-CCCTGTTTGTGCCG BamHI- 

Ndel 

Reverse CCCGCTCGAG-GACGGTGTCATTTCGCC Xhol 

574 Forward CG CGGATCCCATATG -TGGTTTGCCnrrrnr BamHI- 

Ndel 

Reverse CCCGCTCGAG-AACTTCGATTTTATTCGGG Xhol 

575 Forward CGCGGATCCCATATG-GTTTCGGGCGAGG BamHI- 

Ndel 

Reverse CCCGCTCGAG-CATTCCGAATCTGAACAG Xhol 

576 Forward CGCGGATCCCiVrATG-GCCGCCCCCGCATCT BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATTTACTTTTTTGATGTCGAC Xhol 

577 Forward CGCGGATCCCATATG-GAAAGGAACGGTGTATTT BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGGCTGTTTGGTAGATTCG xhol 

578 Forward CGCGGATCCCATATG-AGAAGGTTCGTACAG Bamffl- 

Ndel 

Reverse CCCGCTCGAG-GCCAACGCCTCCACG Xhol 

579 Forward CGCGGATCCQ\TATG-AGATTGGGCGTTTCCAC BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGAATTGATGATGTGTATGT Xhol 

580 Forward CGCGGATCCCATATG-AGGCAGACTTCGCCGA BamHI- 

Ndel 

Reverse CCCGCTCGAG-CACTTCCCCCGAAGTG Xhol 

581 Forward CGCGGATCCCATATG-CACTTCGCCCAGC BamHI- 

Ndel 

Reverse CCCGCTCGAG-CGCCGTTTGGCTTTGG Xhol 

582 Forward AAAAAAGAATTC-TTTGGAGAGACCGCGCTGCAATGCGC EcoRI 
Reverse AAAAAATCTAGA-TCAGATGCCGTCCCAGTCGTTGAA Xbal 

583 Forward AAAAAAGAATTC-ACTGCCGGCAATCGACTGCATAATCG EcoRI 
Reverse AAAAAACTGCAG-TTAACGGAGGTCAATATGATGAAATTG PstI 

584 Forward AAAAA AGAATTC - gco 

GCGGCTGAAGCATTGAATTACAATATTGTC 

Reverse AAAAAACTGCAG-TCAGAACTGAACCGTCCCATTGACGCT PstI 

585 Forward AAAAAAGGTACC-TCTTTCTGGCTGGTGCAGAACACCCTTGC EcoRI 
Reverse AAAAAACTGCAG-TCAGTTCGCACTTTTTTCTGTTTTGGA PstI 

586 Forward CGCGGATCCCATATG-GCAGCCCATCTCG BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTCAGCGAATCAAGTTTC Xhol 

587 Forward CGCGGATCCCATATG-GACCTGCrrTTnArnA BaraHI- 
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588 



Reverse 
Forward 
Reverse 
589 Forward 



590 



Reverse 
Forward 
Reverse 
591 Forward 
Reverse 
592a Forward 
Reverse 
593 Forward 

Reverse 
594a Forward 
Reverse 

595 Forward 
Reverse 

596 Forward 

Reverse 

597 Forward 

Reverse 
601 Forward 



602 



Reverse 
Forward 



603 



Reverse 
Forward 
Reverse 
604 Forward 



Reverse 
606a Forward 
Reverse 

607 Forward 
Reverse 

608 Forward 
Reverse 

609 Forward 



CCCGCTCGAG-AAATGTATGCTGTACGCC 
AAAAAAGAATTC-GCCGTCCTGACTTCCTATCAAGAACCAGG 
AAAAAACTGCAG-TTATTTGTTTTTGGGCAGTTTCACTTC 
AAAAA AGAATTC - 

ATGCAACAAAAAATCCGTTTCCAAATCGAAGG 

AAAAAACTGCAG-CTAATCGATTTTTACCCGTTTCAGGCG 

AAAAAAGAATTC-ATGAAAAAACCTTTGATTTCAGTTGCGGC 

AAAAAACTGCAG-TTACTGCTGCGGCTCTGAAACCAT 

AAAAAAGAAnC-CACTACATCGTTGCCAGATTGTGCGG 

AAAAAACTGCAG-CTAACeGAGCAGCCGGGTAACGTCGTT 

AAAAAAGAATTC-CGCGATTACACCGCCAAGCTGAAAATGGG 

AAAAAACTGCAG-TTACCAAACGTCGGATTTGATACG 

CGCGGATCCGCTAGC-CTTGAACTGAACGGACTC 

CCC GCTCGAG -GCGGAAGCGGACGATT - 

AAAAAAGAATTC-GGTAAGTTCGCCGTTCAGGCCTTTCA 

AAAAAACTGCAG-TTACGCCGCCGTTTCCTGACACTCGCG 

AAAAAAGAATTC-TGCCAGCCGCCGGAGGCGGAGAAAGG 

AAAAAACTGCAG-TTATTTCAAGCCGAGTATGCCGCG 

CGCGGATCCCATATG-TCCCAACAATACGTC 

CCC GCTCGAG -ACGCGTTACCGGTTTGT 
CGCGGATCCCATATG-CTGCTTCATGTCAGC 

GCCCAAGCTT-ACGTATCCAGCTCGAAG 
CGC GGATCCCATATG -ATATGTTCCCAACCGGCAAT 

CCCG CTCGAG -AAAACAATCCTCAGGCAC 
CGCGGATCCGCTAGC-TTGCTCCATCAATGC 

CCCG CTCGAG -ATGCAGCTGCTAAAAGCG 
AAAAAAGAATTC-CTGTCCTCGCGTAGGCGGGGACGGGG 
AAAAAACTGCAG-CTACAAGATGCCGGCAAGTTCGGC 
CGCGGATCCGCTAGC-CCCGAAGCGCACTT 

CCCG CTCGAG -GACGGCATCTGCACGG 

AAAAAAGAATTC-CGCGAATACCGCGCCGATGCGGGCGC 

AAAAAACTGCAG-TTAAAGCGATTTGAGGCGGGCGATACG 

AAAAAAGAATTC-ATGCTGCTCGACCTCAACCGCTTTTC 

AAAAAACTGCAG-TCAGACGGCCTTATGCGATCTGAC 

AAAAAAGAATTC-ATGTCCGCCCTCCTCCCCATCATCAACCG 

AAAAAACTGCAG-TTAGTCTATCCAAATGTCGCGTTC 

CGCGGATCCCATATG-GTTGTGGATAGACTCG 



Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Nhel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Hindni 

BamHI- 

Ndel 

Xhol 

BamHI- 

Nhel 

Xhol 

EcoRI 

PstI 

BamHI- 

Nhel 

Xhol 

EcoRI 

PstI. 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 
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Reverse CCC GCTCGAG -CTGGATTATGATGTCTGTC Xh^f — 

610 Forward CGC GGATCCCATATG -ATTGGAGGGCTTATGCA BamHI- 

Ndel 

Reverse CCCGCTCGAG-ACGCTTCAACATCTTTGCC Xhol 

611 Forward CGCGGATCCCATATG-CCGTCTCAAAACGGG BamHI- 

Ndel 

Reverse CCC GCTCGAG -AACGACTTTGAACGCGCAA xhol 

613 Forward CGCGGATCCCATATG-TCGCGTTCGAGCCG3 BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGCCTGTAAAATAAGCGGC Xhol 

614 Forward CG CGGATCCCATATG -TCCGTCGTGAGCGGC BamHI- 

Ndel 

Reverse CCCGCTCGAG-CCATACTGCGGCGTTC Xhol 
616 Forward AAAAAAGAATTC-ATGTCAAACACAATCAAAATGGTTGTCGG Eco RI 

Reverse AAAAAATCTAGA-TTAGTCCGGGCGGCAGGCAGCTCG Xbal 
619a Forward AAAAAAGAATTC-GGGCTTCTCGCCGCCTCGCTTGC Eco RI 

Reverse AAAAAACTGCAGTCATTTTTTGTGTTTTAAAACGAGATA PstI 
622 Forward CGC GGATCCCATATG -GCCGCCCTGCCTAAAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTGTCCAAATGATAAATCTG Xhol 

624 Forward CGCGGATCCCATATG-TCCCCGCGCTTTTACCG BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGATTCGGGCCTGCGC Xhol 

625 Forward CGC GGATCCCATATG -TTTGCAACCAGGA A A ATO BamHI- 

Ndel 

Reverse CCCGCTCGAG-CGGCAAAATTACCGCCTT Xhol 
627a Forward AAAAAAGAATTC-AAAGCAGGCGAGGCAGGCGCGCTGGG Eco RI 
Reverse AAAAA ACTGCAG - Psti 
TTACGAATGAAACAGGGTACCCGTCATCAAGGC 
628 Forward AAAAAAGGTACC-GCCTTACAAACATGGATTTTGCGTTC Kpnl 
Reverse AAAAAACTGCAG-CTACGCACCTGAAGCGCTGGCAAA Pst I 

629a Forward AAAAAAGAATTC-GCCACCTTTATCGCGTATGAAAACGA Eco RI 

Reverse AAAAAACTGCAG-TTACAACACCGCCGTCCGGTTCAAACC Pst I 
630a Forward AAAAAAGAATTC-GCGGCTTTGGGTATTTCTTTCGG Eco RI 

Reverse AAAAAACTGCAG-TTAGGAGACTTCGCCAATGGAGCCGGG PstI 
635 Forward AAAAA AGAATTC - Eco RI 

ATGACCCAGCGACGGGTCGGCAAGCAAAACCG 
Reverse AAAAAACTGCAG-TTAATCCACTATAATCCTGTTGCT Pst I 

638 Forward AAAAAAGAATTC-ATGATTGGCGAAAAGTTTATCGTAGTTGG EcoRI 
Reverse AAAAAACTGCAG-TCACGAACCGATTATGCTGATCGG Pst I 

639 Forward CGCGGATCCCATATG-ATGCTTTATTTTGTTCG BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATCGCGGCTGCCGAC Xhol 
642 Forward CGCGGATCCCATATG-CGGTATCCGCCGCAAT BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGGATTGCGGGGCATTA Xhol 
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643 Forward CGC GGATCCCATATG -GCTTCGCCGTCGGCAG BamHI- 

Ndel 

Reverse CCC GCTCGAG -AACCGAAAAACAGACCGC Xhol 

644 Forward AAAAA AGAATTC - Eco RI 

ATGCCGTCTGAAAGGTCGGCGGATTGTTGCCC 

Reverse AAAAAATCTAGA-CTACCCGCAATATCGGCAGTCCAATAT PstI 

645 Forward AAAAAAGAATTC-GTGGAACAGAGCAACACGTTAAATCG Eco RI 
Reverse AAAAAACTGCAG-CTACGAGGAAACCGAAGACCAGGCCGC PstI 

647 Forward AAAAAAGAATTC-ATGCAAAGGCTCGCCGCAGACGG Eco RI 
Reverse AAAAAACTGCAG-TTAGATTATCAGGGATATCCGGTAGAA PstI 

648 Forward AAAAA AGAATTC - Eco RI 

ATGAACAGGCGCGACGCGCGGATCGAACG 

Reverse AAAAAACTGCAG-TCAAGCTGTGTGCTGATTGAATGCGAC Pst I 

649 Forward AAAAAAGAATTC-GGTACGTCAGAACCCGCCCACCG Eco RI 
Reverse AAAAAACTGCAG-TTAACGGCGGAAACTGCCGCCGTC Pst I 

650 Forward AAAAAAGAATTC-ATGTCCAAACTCAAAACCATCGC Eco RI 
Reverse AAAAAACTGCAG-TCAGACGGCATGGCGGTCTGTTTT PstI 

652 Forward AAAAA AGGTACC - Kpn I 

GCTGCCGAAGACTCAGGCCTGCCGCTTTACCG 

Reverse AAAAAACTGCAG-TTATTTGCCCAGTTGGTAGAATGCGGC PstI 

653 Forward AAAAAAGAAnC-GCGGCTTTGCCGGTAATTTTCATCGG Eco RI 
Reverse AAAAAACTGCAG-CTATGCCGGTCTGGTTGCCGGCGGCGA Pst I 

656a Forward AAAAAAGAATTC-CGGCCGACGTCGTTGCGTCCTAAGTC Eco RI 

Reverse AAAAAACTGCAG-CTACGATTTCGGCGATTTCCACATCGT PstI 

657 Forward AAAAAAGAATTC-GCAGAATTTGCCGACCGCCATTTGTGCGC Eco RI 
Reverse AAAAAACTGCAG-TTATAGGGACTGATGCAGTTTTTTTGC Pst I 

658 Forward CGCGGATCCCATATG-GTGTCCGGAATTGTG BamHI- 

Ndel 

Reverse CCCGCTCGAG-GGCAGAATGTTTACCGTT Xhol 

661 Forward AAAAA AGAATTC - Eco RI 
ATGCACATCGGCGGCTATTTTATCGACAACCC 

Reverse AAAAAACTGCAG-TCACGACGTGTCTGTTCGCCGTCGGGC Pst I 

663 Forward CGCGGATCCCATATG-TGTATCGAGATGAAATT BamHI- 

Ndel 

Reverse CCCGCTCGAG-GTAAAAATCGGGGCTGC Xhol 

664 Forward CGCGGATCCCATATG-GCGGCTGGCGCGGT BamHI- 

Ndel 

Reverse CCCGCTCGAG-AAATCGAGTTTTACACCAC Xhol 

665 Forward AAAAAAGAATTC-ATGAAATGGGACGAAACGCGCTTCGG Eco RI 
Reverse AAAAAACTGCAG-TCAATCCAAAATTTTGCCGACGATTTC Pst I 

666 Forward AAAAAAGAATTC-AACTCAGGCGAAGGAGTGCTTGTGGC EcoRI 
Reverse AAAAAATCTAGA-TCAGTTTAGGGATAGCAGGCGTAC Xbal 

667 Forward AAAAA AGAATTC - Eco lU 

CCGCATCCGTTTGATTTCCATTTCGTATTCGTCCG 

Reverse AAAAAACTGCAG-TTAATGACACAATAnornTA Ar.rr pct t 
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669 Forward AAAAAAGAATTC-ATGCGCCGCATCATTAAAAAACACCAGCC Eco RI 

I^GGC Pst I 

GTCTTC Eco RI 
Pst I 





Reverse 


670a Forward 




Reverse 


ATI 


Forward 




Reverse 


672 


Forward 






673 


Forward 




Reverse 


674 


Forward 




Reverse 


u/D 


Forward 




Reverse 




Forward 




Reverse 


680 


Forward 




Reverse 


681 


Forward 




Reverse 


683 


Forward 




Reverse 


684 


Forward 




Reverse 


685 


Forward 




Reverse 


VJOVJ 


r \Ji W dl u 




Reverse 


687 


Forward 




Reverse 


690 


Forward 




Reverse 


691 


Forward 




Reverse 



TTAGGAGCTTTTGGAACGCGTCGGACTGGC 

CG CGGATCCCATATG -ACCAGCAGGGTAAC BamHI- 

Ndel 

CCC GCTCGAG -AGCAACTATAAAAACGCAAG Xhol 
CGCGGATCCCATATG-AGGAAAATCCGCACC BaraHI- 

Ndel 

CCC GCTCGAG -ACGGGATAGGCGGTTG xhol 
AAAAAAGAATTC-ATGGATATTGAAACCTTCCTTGCAGG Eco RI 
AAAAAACTGCAG-CTACAAACCCAGCTCGCGCAGGAA Pst I 

AAAAAAGAATTC-ATGAAAACAGCCCGCCGCCGTTCCCG Eco RI 
AAAAAACTGCAG-TCAACGGCGTTTGGGCTCGTCGGG Pst I 

CG CGGATCCCATATG -AACACCATCGCCCC BamHI- 

Ndel 

CCC GCTCGAG -TTCTTCGTCTTCAAACTGT xhol 
AAAAAAGAATTC-AGACGGCATTCCCGATCAGTCGATTTTGA Eco RI 
AAAAAACTGCAG-TTACGTATGCGCGAAATCGACCGCCGC Pst I 
CGCGGATCCGCTAGC-ACGAAGGGCAGTTCGG BamHI- 

Nhel 

CCC GCTCGAG- CATCAAAAACCTGCCGC xhol 
AAAAAAGAATTC-ATGACGACGCCGATGGCAATCAGTGC EcoRI 
AAAAAACTGCAG-TTACCGTCTTCCGCAAAAAACAGC Pst I 

CGC GGATCCCATATG -TGCAGCACACCGGACAA BamHI- 

Ndel 

CCC GCTCGAG -GAGTTTTTTTCCGCATACG xhol 
CG CGGATCCCATATG -TGCGGTACTGTGCAAAG BamHI- 

Ndel 

CCCGCTCGAG-CTCGACCATCTGTTGCG Xhol 
CGC GGATCCCATATG -TGTTTGCTTAATA ATA A APATT BamHI- 

Ndel 

CCC GCTCGAG -CTTTTTCCCCGCCGCA xhol 
CGC GGATCCCATATG -TGCGGCGGTTCGGAAG BamHI- 

Ndel 

CCC GCTCGAG -CATTCCGATTCTGATGAAG xhol 
CGCGGATCCCATATG-TGCGACAGCAAAGTCCA BamHI- 

Ndel 

CCC GCTCGAG- CTGCGCGGCTTTTTGTT xhol 
CGCGGATCCCATATG-TGTTCTCCGAGCAAAGAC BamHI- 

Ndel 

CCC GCTCGAG -TATTCGCCCCGTGTTTGG xhol 
CGCGGATCCCATATG-GCCACGGCTTATATCCC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTGAGGCAGGAAGAAAG Xhol 
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694 Forward CGC GGATCCCATATG -TTGGTTTCCGCATCCGG BamHI- 

Ndel 

Reverse CCCGCTCGAG-TCTGCGTCGGTGCGGT Xhol 

695 Forward CGCGGATCCCATATG-TTGCCTCAAACTCGTCCG BamHI- 

Ndel 

Reverse CCCGCTCGAG-TCGTTTGCGCACGGCT Xhol 

696 Forward CGC GGATCCCATATG -TTGGGTTGCCGGCAnrT BamHI- 

Ndel 

Reverse CCC GCTCGAG -TTGATTGCCGCAATGATG xhol 
700a Forward AAAAAAGAATTC-GCATCGACAGACGGTGTGTCGTGGAC Eco RI 
Reverse AAAAAACTGCAG-TTACGCTACCGGCACGACTTCCAAACC PstI 

701 Forward CGC GGATCCCATATG -AAGACTTGTTTGGATArTTr BamHI- 

Ndel 

Reverse CCCGCTCGAG-TGCCGACAACAGCCTC Xhol 

702 Forward AAAAAAGAATTC-ATGCCGTGTTCCAAAGCCAGTTGGATTTC EcoRI 
Reverse AAAAAACTGCAG-TTAACCCCATTCCACCCGGAGAACCGA PstI 

703 Forward CGCGGATCCGCTAGC-CAAACGCTGGCAACCG BamHI- 

Nhel 

Reverse CCCGCTCGAG-TTTTGCAGGTTTGATGTTTG Xhol 
704a Forward AAAAAAGAATTC-GCTTCTACCGGTACGCTGGCGCG Eco RI 

Reverse AAAAA ACTGCAG - Pstj 
TTAGTTTTGCCGGATAATATGGCGGGTGCG 

707 Forward CGCGGATCCGCTAGC-GAAATTATTAACGATGCAGA BamHI- 

Nhel 

Reverse CCCGCTCGAG-GAAACTGTAATTCAAGTTGA Xhol 

708 Forward CGC GGATCCGCTAGC -CCTTTTAAGCCATrrA AAA BamHI- 

Nhel 

Reverse CCC GCTCGAG -TTGACCGGTGAGGACG xhol 
710 Forward CGC GGATCCCATATG -GAAACCCACGA A A A A ATP BamHI- 

Ndel 

Reverse CCCGCTCGAG-AACGGTTTCGGTCAG Xhol 
714 Forward CGC GGATCCCATATG -AGCTATCAAGACATrTT BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCGGTAGGTAAATCGGAT Xhol 
716 Forward CGCGGATCCCATWTG-GCCAACAAACCGGCAAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTAGAACCGCATTTGCC Xhol 
718 Forward CGC GGATCCCATATG -GAGCCGATAATGGP A A A BamHI- 

Ndel 

Reverse CCCGCTCGAG-GGCGCGGGCATGGTCTTGTCC Xhol 
720 Forward CGC GGATCCCATATG -AGCGGATGGG AT A rr BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTTGCATAGCTGTTGACCA Xhol 
723 Forward CGCGGATCCCATATG-CGACCCAAGCCCC BamHI- 

Ndel 

Reverse CCCGCTCGAG-AATGCGAATCCGCCGCC xhol 
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725 


Forward CGCGGATCCCATATG-GTGCGCACGGTTAAA 


BamHI- 




Reverse CCCGCTCGAG-TTGCTTATCCTTAAGGGTTA 


Ndel 
Xhol 


726 


Forward CGCGGATCCCATATG-ACCATCTATTTCAA A A Ar 


BamHI- 




Reverse CCCGCTCGAG-GCCGATGTTTAGCGTCC 


Ndel 
Xhol 


728 


Forward CGCGGATCCCATATG-TTTTGGCTGGGAACnrTG 


BamHI- 




Reverse CCCGCTCGAG-GTGAGAAAGGTCGCGC 


Ndel 
Xhol 


729 


Forward CGCGGATCCCATATG-TGCACCATGATTCCrCA 


BamHI- 




Reverse GCCCAAGCTT-TTTGTCGGTTTGGGTATC 


Ndel 
Hindin 


731 


Forward CGCGGATCCGCTAGC-GCCGTGCCGGAGG 


BamHI- 




Reverse CCCGCTCGAG-ACGGGCGCGGCAG 


Nhel 
Xhol 


732 


Forward CCGGAATTCTACATATG-TCGAAACCTGTTTTTAAGA A 


EcoRI- 




Reverse CCCGCTCGAG-CTTCTTATCTTTTTTATCTTTf: 


Ndel 
Xhol 


733 


Forward CGCGGATCCCATATG-GCCTGCGGCGGCAA 


BamHI- 

1 # fclJ 111 A-X. 




Reverse CCCGCTCGAG-TCGCTTGCCTCCTTTAC 


Ndel 
Xhol 


734 


Forward CGCGGATCCCATATG-GCCGATACTTACGGCTAT 


BamHI- 




Reverse CCCGCTCGAG-TTTGAGATTTTGAATCAAAGAG 


Ndel 
Xhol 


735 


Forward CGCGGATCCCATATG-AAGCAGCAGGCGGTnA 


BamHI- 




Reverse CCCGCTCGAG-ATTTCCGTAGCCGAGGG 


Ndel 
Xhol 


737 


Forward CGCGGATCCCATATG-CACCACGACGGACArn 


BamHI- 




Reverse CCCGCTCGAG-GTCGTCGCGGCGGGA 


Ndel 
Xhol 


739 


Forward CGCGGATCCCATATG-GCAAAAAAACCGAACA 


BamHI- 




Reverse CCCGCTCGAG-GAAGAGTTTGTCGAGAATT 


Ndel 
Xhol 


740 


Forward CGCGGATCCCATATG-GCCAATCCGCCCGAAG 


BamHI- 




jvcverse i i^Ltaij-aaacvjCvjUCAAAA 1 AG T G 


Ndel 
Xhol 


741 


Forward CGCGGATCCCATATG-TGCAGCAGCGGAGGG 


BamHI- 






Ndel 
Xhol 


743 


Forward CGCGGATCCCATATG-GACGGTGTTGTGCCTGTT 


BamHI- 




Reverse CCCGCTCGAG-CTTACGGATCAAATTGACG 


Ndel 
Xhol 


745 


Forward CGCGGATCCCATATG-TTTTGGCAACTGArrG 


BamHI- 




Reverse CCCGCTCGAG-CAAATCAGATGCCTTTAGG 


Ndel 
Xhol 


746 


Forward CGCGGATCCCATATG-TCCGAAAACAAACAAAAC 


BamHI- 
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Ndel 




Reverse CCCGCTCGAG-TTCATTCGTTACCTGACC 


Xhol 


747 


Forward CCGGAATTCTAGCTAGC-CTGACCCCTTGGG 


EcoRI- 






Nhel 




Reverse GCCCAAGCTT-TTTTGATTTTAATTGACTATAGAAC 


Hindin 


749 


Forward CGCGGATCCCATATG-TGCCAGCCGCCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTCAAGCCGAGTATGC 


Xhol 


750 


Forward CGCGGATCCCATATG-TGTTCGCCCGAACCTG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CTTTTTCCCCGCCGCAA 


Xhol 


758 


Forward CGCGGATCCCATATG-AACAATCTGACCGTGTT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TGGCTCAATCCTTTCTGC 


Xhol 


759 


Forward CGCGGATCCGCTAGC-CGCTTCACACACACCAr 


BamHT- 






Nhel 




Reverse CCCGCTCGAG-CCAGTTGTAGCCTATTTTG 


Xhol 


763 


Forward CGCGGATCCCATATG-CTGCCTGAAGCATGGCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTCCGCAAATACCGTTTCC 


Xhol 


764 


Forward CGCGGATCCCATATG-TTTTTCTCCGCCCTGA 


BamHI- 






Ndel 




JVC verse i i 1 i^UC 1 AAACjC 1 ITC 


Xhol 


765 


Forward CGCGGATCCCATATG-TTAAGATGCCGTCCG 


BamHI- 






Ndel 




iveverse ^i^i^ov^ i v^LiALj-AL/LrUUlaAUU i 1 1 1 1 1 A 1 tAA 


Xhol 


767 


Forward CGCGGATCCCATATG-CTGACGGAAGGGGAAG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTCTGTACAGCAGGGG 


Xhol 


768 


Forward CGCGGATCCCATATG-GCCCCGCAAAAACGGG 


BamHI- 




Reverse CCCGCTCGAG-TTTCATCCCTTTTTTGAGC 


Ndel 
Xhol 


770 


Forward CGCGGATCCCATATG-TGCGGCAGCGGCGAA 


BamHI- 




Keverse i cua^j-vjCvj 1 1 1 G rCGAGATTTTC 


Ndel 
Xhol 


771 


Forward CGCGGATCCCATATG-TCCGTATATCGGArrTTr 


RamHT- 






Ndel 




Reverse CCCGCTCGAG-CGGTTCTTTAGGTTTGAG 


Xhol 


772 


Forward CGCGGATCCCATATG-TTTGCGGCGTTGGTGG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CAATGCCGACATCAAACG 


Xhol 


774 


Forward CGCGGATCCCATATG-TCCGTTTCACCCGTTrr: 


BamHI- 




Reverse CCCGCTCGAG-TCGTTTGCGCACGGCT 


Ndel 
Xhol 


790 


Forward CGCGGATCCCATATG-GCAAGAAGGTCA A A A Ar 


BamHI- 






Ndel 
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Reverse CCCGCTCGAG-GGCGTTGTTCGGATTTCG 
900 Forward CGCGGATCCCATATG-CCGTCTGAAATGCCG 



901 



Reverse CCCGCTCGAG-ATATGGAAAAGTCTGTTGTC 
Forward CGC GGATCCCATATG -CCCGATTTTTmATrT 



Reverse CCCGCTCGAG-AAAATGGAACAATACCAGG 

902 Forward. CCGGAATTCT ACATATG -TTGCACTTTCAAAGGATAATr 

2 

Reverse CCCGCTCGAG-AAAAATGTACAATGGCGTAC 

903 Forward CCGGAATTCTAGCTAGC-CAGCGTCAGCAGCACAT 

Reverse CCCGCTCGAG-GAAACTGTAATTCAAGTTGAA 

904 Forward AAAAAAGGTACC-ATGATGCAGCACAATCGTTTC 
Reverse AAACTGCAG-TTAATATCGATAGGTTATATG 

904a Forward AAAAAAGAATTC:CGGCTCGGCATTGTGCAGATGTTGCA 
Reverse AAACTGCAG-TTAATATCGATAGGTTATATG 

905 Forward CGCGGATCCCATATG-AACAAAATATACCGCATC 

Reverse CCCGCTCGAG-CCACTGATAACCGACAGAT 

907 Forward CGCGGATCCCATATG-GGCGCGCAACGTGAG 

Reverse CCCGCTCGAG-ACGCCACTGCCAGCG 

908 Forward AAAGAATTC-GCAGAGTTAGTAGGCGTTAATAAAAATAC 
Reverse AAACTGCAG-TTAATATGGTTTTGTCGTTCG 

909 Forward CGCGGATCCCATATG-TGCGCGTGGGAAACTTAT 

CCCGCTCGA^-TCGGTTTTGAAACTTTGGTTTT 
AAAGAATTC-GCATTTGCCGGCGACTCTGCCGAGCG 
AAACTGCAG-TCAGCGATCGAGCTGCTCTTT 
AAAGAATTC-GCTTTCCGCGTGGCCGGCGGTGC 

AAAAAACTGCAG-GTCGACTTATTCGGCGGCTTTTTCCGC 
AAAAA AGAATTC - 

CAAATCCGTCAAAACGCCACTCAAGTATTGAG 
AAAAAACTGCAG-TTACAGTCCGTCCACGCCTTTCGC 
CGCGGATCCCATATG-GAAACCCGCCCCGC 



Reverse 

910 Forward 
Reverse 

911 Forward 
Reverse 

912 Forward 

Reverse 

913 Forward 



Reverse CCCGCTCGAG-AGGTTGTGTTCCAGGTTG 

915 Forward CGCGGATCCCATATG-TGCCGGCAGGCGGAA 

Reverse CCCGCTCGAG-TTTGAAAATATAGGTATCAGG 
914 Forward AAAGAATTC-GACAGAATCGGCGATTTGGAAGCACG 
Reverse AAACTGCAG-CTATATGCGCGGCAGGACGCTCAACGG 

916 Forward CGCGGATCCCATATG-GCAATGATGGCGGCTG 

Reverse CCCGCTCGAG-TTTGGCGGCATCTTTCAT 



Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI- 

Ndel 

Xhol 

EcoRI- 

Nhel 

Xhol 

Kpnl 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 
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917 Forward AAAAAAGAATTC-CCTGCCGAAAAACCGGCACCGGC EcoRI 

Reverse AAAAAACTGCAG-TTATTTCCCCGCCTTCACATCCTG PstI 

919 Forward CGCGGATCCCATATG-TGCCAAAGCAAGAGCATC BamHI- 

Ndel 

Reverse CCC GCTCGAG -CGGGCGGTATTCGGG Xhol 

920 Forward CGCGGATCCCATATG-CACCGCGTCTGGGTC BamHI- 

Ndel 

Reverse CCC GCTCGAG -ATGGTGCGAATGACCGA xhol 

921 Forward AAAAAAGAATTC-TTGACGGAAATCCCCGTGAATCC Eco RI 
Reverse AAAAAACTGCAG-TCATTTCAAGGGCTGCATCTTCAT Pst I 

922 Forward. CGCGGATCCGCTAGC-TGTACGGCGATGGAGGC BamHI- 

2 Nhel 
Reverse CCCGCTCGAG-CAATCCCGGGCCGCC Xhol 

923 Forward CGC GGATCCCATATG -TGTTACGCAATATTnTrrr BamHI- 

Nhel 

Reverse CCCGCTCGAG-GGACAAGGCGACGAAG xhol 

925 Forward CGC GGATCCCATATG -AAACAAATGCTTTTAnrrrx BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCCGTTGCATTTGATTTC Xhol 

926 Forward CGCGGATCCCATATG-TGCGCGCAATTACCTC BamHI- 

Ndel 

Reverse CC CGCTCGAG -TCTCGTGCGCGCCG Xhol 

927 Forward CGCGGATCCCATATG-TGCAGCCCCGCAGC BamHI- 

Ndel 

Reverse CCCGCTCGAG-GTTTTTTGCTGACGTAGT Xhol 
929a Forward AAAAAAGAATTC-CGCGGTTTGCTCAAAACAGGGCTGGG EcoRI 

Reverse AAAAAATCTAGA-TTAAGAAAGACGGAAACTACTGCC Xbal 
931 Forward AAAAAAGAATTC-GCAACCCATGTTTTGATGGAAAC Eco RI 

Reverse AAAAAACTGCAG-TTACTGCCCGACAACAACGCGACG PstI 

935 Forward AAAAA AGAATTC - Eco RI 

GCGGATGCGCCCGCGATTTTGGATGACAAGGC 
Reverse AAAAAACTGCAG-TCAAAACCGCCAATCCGCCGACAC PstI 

936 Forward CG CGGATCCCATATG -GCCGCCGTCGGCGT BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCGTTGGACGTAGTTTTG Xhol 

937 Forward AAAAAAGAATTC-CCGGTTTACATTCAAACCGGCGCAAC EcoRI 
Reverse AAAAAACTGCAG-TTAAAATGTATGCTGTACGCCAAA Pst I 

939a Forward AAAAAAGAATTC-GGTTCGGCAGCTGTGATGAAACC Eco RI 

Reverse AAAAAACTGCAG-TTAACGCAAACCTTGGATAAAGTTGGC PstI 
950 Forward CGCGGATTCCATATG-GCCAACAAACCGGCAAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTAGAACCGCATTTGCC Xhol 
953 Forward CGCGGATCCCATATG-GCCACCTACAAAGTGGAC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTGTTTGGCTGCCTCGAT Xhol 
957 Forward CGCGGATCCCATATG-TTTTGGCTGGGAArnnrt BamHI- 
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Reverse CCCGCTCGAG-GTGAGAAAGGTCGCGC 

958 Forward CGCGGATCCCATATG-GCCGATGCCGTTGCG 

Reverse GCCCAAGCTT-GGGTCGTTTGTTGCGTC 

959 Forward CGCGGATCCCATyMTC-CACCACGACGGACACG 

Reverse CCCGCTCGAG-GTCGTCGCGGCGGGA 
961 Forward CGCGGATCCCATATG-GCCACAAGCGACGACG 

Reverse CCCGCTCGAG-CCACTCGTAATTGACGC 

972 Forward AAAAA AGAATTC - 

TTGACTAACAGGGGGGGAGCGAAATTAAAAAC 
Reverse AAAAAATCTAGA-TTAAAAATAATCATAATCTACATTTTG 

973 Forward AAAAAAGAATTC-ATGGACGGCGCACAACCGAAAAC 
Reverse AAAAAACTGCAG-TTACTTCACGCGGGTCGCCATCAGCGT 

982 Forward CGCGGATCCCATWTG-GCAGCAAAAGACGTAC 

Reverse CCCGCTCGAG-CATCATGCCGCCCATCC 

983 Forward CG CGGATCCCATATG -TTAGCTGTTflT A Ar A Ar a r 

Reverse CCCGCTCGAG-GAACCGGTAGCCTACG 

987 Forward CGCGGATCCCATATG-CCCCCACTGGAAGAAC 

Reverse CCCGCTCGAG-TAATAAACCTTCTATGGGC 

988 Forward CGC GGATCCCATATG -TCTTTAAATTTAronr.A a a a Ar. 

Reverse GCCCAAGCn-TGATTTGCCTTTCCGTTTT 

989 Forward CC GGAATTCT ACATATG-GTCCAr.rTrATrrr^.rT^ 

Reverse CCCGCTCGAG-TTTGAATTTGTAGGTGTATTGC 

990 Forward. CGCGGATOCGCTAGC-TTCAGAGCTCAGCTT 
2 

Reverse CCCGCTCGAG-AAACAGCCATTTGAGCGA 

992 Forward CGC GGATCCCATATG -GACGCGCrrGrrra 

Reverse CCCGCTCGAG-CCAAATGCCCAACCATTC 

993 Forward CGCGGATCCCATATG-GCAATGCTGATTGAAATCA 

Reverse CCCGCTCGAG-GAACACATCGCGCCCG 

996 Forward CGCGGATCCCATATG-TGCGGCAGAAAATCCG^ 

Reverse CCCGCTCGAG-TCTAAACCCCTGTTTTCTG 

997 Forward CCGGAATTCTAGCTAGC-CGGCACGCCGACGIT 

Reverse CCCGCTCGAG-GACGGCATCGTTrAnn 



Ndel 

Xhol 

BamHI- 

Ndel 

Hindni 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Xbal 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Hindin 

EcoRI- 

Ndel 

Xhol 

Bamm- 

Nhel 

Xhol 

Bamm- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI- 

Nhel 

Xhol 



wo 99/57280 




PCTAJS99/09346 



Underlined sequences indicate restriction recognition sites. 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 1>: 

gOOl . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG GTGTCGGCGA ACGAGGTGTC 

51 CGGCAGGGCT TGCGCCCGGA TGGTGCTGGT CATCTGCCAG ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGC TCGGGTACGC ATACTTTACC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTGCG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCGGTTT GTAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCTGAAG CGATGTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCGGCTTCAT CGGGCAGGTG GGACAATACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 2; ORF 001. ng>: 

gOOl.pep 

1 MLPQGKAARR VSANEVSGRA CA RMVLVICO TLPKRDTLNG SGTHTLPVWA 

51 ILPRSLRSKS TIITFSARFF GSVCNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEAMLRKSS GEKHSVHADC PASSGRWDNT A* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 3>: 

mOOX . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGcAssCTT ss. GCTTGGA yGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GATCGTTACG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCCTCCGCAT CGGGCAGGTG GGACAAGACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 00 1>: 

mOOl .pep 

1 MLPQGKAARR MSANEVCGXL XA WXVLVICQ TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PSASGRWDKT A* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 5>: 

aOOl.seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGCAAGGCT TGGGCTTGGA TGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTACG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCTTGTGCAT CGGGCAGGTG GGACAAAACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 6; ORF 001.a>: 

aOOl.pep , 

1 MLPQGKAARR MSANEVCGKA W AWMVLVICO TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PCASGRWDKT A* 

mOOl/aOOl 96.2% identity over a 131 aa overlap 

10 20 30 40 50 60 

mOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 

OXillllltll MMIIIIIIIII ItlMlllllllllll 

aOOl . pep MLPQGKAARRMSANEVCGKAWAWMVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 



wo 99/57280 




PCT/US99/09346 



10 



20 



30 



40 



50 



60 



70 80 90 100 110 120 

mOOl.pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 

> > I M N I t I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I ,,,, , 

aOOl . pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 

70 80 90 100 110 120 

130 

mOOl . pep PSASGRWDKTAX 
I I I I I I I I I I I 
aOOl .pep PCASGRWDKTAX 
130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K ^gonorrhoeae 

ORF 001 shows 89.3% identity over a 131 aa overlap with a predicted ORF (ORF 001. ng) 
from K gonorrhoeae: 

mOOl/gOOl 



10 20 30 40 50 60 

iTlOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 

' " I " I • " " = ' I I I I I 1 I I [ I I I I I I I I I I I I I I I i I I : i I I I M i I I 

gOOl MLPQGKAARRVSANEVSGRACARMVLVICQTLPKRDTLNGSGTHTLPVWAILPRSLRSKS 
10 20 30 40 50 60 

70 80 90 100 110 120 

mOOl.pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 

l'><>>l ^MDIMIIIIIIIIIIIIIIIllllllll MIMIIIIIIIIIII 

gOOl TIITFSARFFGSVCNSAARRSSCPSPKIGAVPFIGSVLMVPSEAMLRKSSGEKHSVHADC 

70 80 90 100 110 120 

130 

mOOl . pep PSASGRWDKTAX 
I :: I I I I I : I I I 
gOOl PASSGRWDNTAX 
130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 7>: 

g003.8eq 

1 ATGGTCGTAT TCGTGGCTGA AGGCGTATTC GGTCGCGCTG TTTTGGGTCA 

51 CTTGGTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGCTTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAGG TAGTCCTGAT GGTATTCCTC GGCGTCGTAG 

251 AAGTTTTTCA GCGGTTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTGC CGGTGTCGTT ACCCTGTTTG 

401 TTGAGGCTGG TCGGATCAAC GACGCGGAAA TAATATTGCA GGATGTCGTC 

451 CAGgCTGagt TTGTCGGCAT CGTaggtcac tTTGACGGTC TCGGCATGAC 

501 CCGTATGGCG GTaggacact tctTCgtanc TcGGGtTTTC CGTGttGCCG 

551 TTGGCgttac cGGATACCGC gtcaACCACG CCGTcgatgc gttggaAATa 

601 ggCTTCCAAg ccccaaaagc agccgccggc gaagtaaatg gtgcccgtot 

651 tcatgattGC TGa 



This corresponds to the amino acid sequence <SEQ ID 8; ORF 003.ng>: 

g003.pep 

1 MWFVAEGVF GRAVLGHLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGFG 

51 FARQRFVGFA DVDV AVAVGV FNOWLMVFL GWEVFQRFV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVG WH AAAVLRAGW TLFV EAGRTN DAEIILQDW 
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151 QAEFVGIVGH FDGLGMTRMA VGHFFVRVFR VAVGVTGYRV NHAVDALEIG 
201 FQAPKAAAGE VNGARVHDC 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 9>: 

aOOS.seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGsTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGGG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CAGCkTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTtCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGk CGGCGATGAC GGCTTTTTCG kCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATTGCA GGATGTCGTC 

4 51 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCg.CCGGC GAGGTAAATG GTGCGCGTGT 

651 TCATGATTTT TGA 



This corresponds to the amino acid sequence <SEQ ID 10; ORF 003>: 

mOOa.pop Length: 221 

1 MWFVAEGIF GRAVLGNLXL LFGQGAFEFG VTRFFIRCRV EAFALRGGLG 

51 FARQRFVSXA DVDV AVAVGV FNOWLMVFL GIVEVFQRLV FNNEGQLVFL 

101 LLAFEGXGDD GFFXGVG VVH AAAVLRTGW ALFV EAGRIN DAEEILQDVV 

151 *AEFVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 

201 GFQAPEAAXG EVNGARVHDF * 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 1 1>: 

a003.seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGGTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATATCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTTCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

401 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATTGCA GGATGTCGTC 

451 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCCGCCGGC GAGGTAGATG GTGCGCGTGT 

651 TCATGATTTT TGA 



This corresponds to the amino acid sequence <SEQ ID 12; ORF 003.a>: 

aOOS.pep 

1 MWFVAEGIF GRAVLGNLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGLG 

51 FARQRFVGFA DI DVAVAVGV FNOWLMVFL GIVEVFQRLV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVG WH AAAVLRTGW ALFV EAGRT^3 DAEEILQDW 

151 *AEFVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 

201 GFQAPEAAAG EVDGARVHDF * 

moos/aoos 95.9% identity over a 220 aa overlap 



10 20 30 40 50 60 

m003 . pep "VVFVAEG I FGRAVLGNLXLLFGQGAFEFGVTRFFIRCRVEAFALRGGLG FARQRFVSXA 

"""""IIIMIIII lillllllllllllMIIIII I III 11 I I II |. I 

a003 MWFVAEGIFGRAVLGNLVLLFGQGAFEFGVTRFFIRCRVEAFALRCGLGFARQRF^^ 

10 20 30 40 50 60 
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.70 80 90 100 110 120 

m003 . pep DVDVAVAVGVFNQWLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGXGDDGFFXGVGWH 

I - I I I I I I I M I I M 1 I I I I I I I I I I M I I I i i I I I I I I I I I I I I I I I I t I I I 

a003 DIDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGGGDDGFFGGVGWH 
70 80 90 100 110 120 

130 140 150 160 170 180 

m003 . pep AAAVLRTGWALFVEAGRINDAEEILQDWXAEFVGIVGHFDGFGVARMAVGHVFIARIF 

I I I I I I I M 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t M I I I I I I I I Ill 

a003 AAAVLRTGWALFVEAGRINDAEEILQDWXAEFVGIVGHFDGFGVARMAVGHVFIARIF 

130 140 150 160 170 180 

190 200 210 220 

m003 . pep RVAVGVAGYRVNHAVDALEIGFQAPEAAXGEVNGARVHDFX 
MIMIMMIIIIIIIillllllllM 111:11111111 
a003 RVAVGVAGYRVNHAVDALEIGFQAPEAAAGEVDGARVHDFX 

190 200 210 220 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 003 shows 88.6% identity over a 219 aa overlap with a predicted ORF (ORF 003.ng) 
from N. gonorrhoeae: 

m003/g003 



10 20 30 40 50 60 

m003 . pep MWFVAEGIFGRAVLGNLXLLFGQGAFEFGVTRFFIRCRVEAFALRGGLGFARQRFVSXA 
<IIIMII:lllllll:l IMIIIIIIMIIIIIIIIIIIIIIII |:||||||||: | 
g003 MWFVAEGVFGRAVLGHLVLLFGQGAFEFGVTRFFIRCRVEAFALRCGFGFARQRFVGFA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m003 . pep DVDVAVAVGVFNQWLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGXGDDGFFXGVGWH 

1)1 IIIIIIIIMII:i :MIMIIIIIIIIIIII llllll llllll 

g003 DVDVAVAVGVFNQWLMVFLGWEVFQRFVFNNEGQLVFLLLAFEGGGDDGFFGGVGWH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m003 . pep AAAVLRTGWALFVEAGRINDAEEILQDWXAEFVGIVGHFDGFGVARMAVGHVFIARIF 

> N II I : II I : I II I II II I II II I II II I I : I :: I I II I I | : f : | 

g003 AAAVLRAGWTLFVEAGRINDAEIILQDWQAEFVGIVGHFDGLGMTRMAVGHFFV-RVF 

130 140 150 160 170 180 

190 200 210 220 

m003 . pep RVAVGVAGYRVNHAVDALEIGFQAPEAAXGEVNGARVHDFX 

IIMIIUIIIIItlllllMIIII:!! Illlllllll 
g003 RVAVGVTGYRVNHAVDALEIGFQAPKAAAGEVNGARVHDC 

190 200 210 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 13>: 

g004 . seq 

1 ATGgtagAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GCGCCCATGC CAACAagtga gccaAAtgtT CGGCGGCAGG GCCTacgatT 

101 TCCGCGCCGA TAAagcggcc gGTGgctTTT tcgGCataca ggcgcaTatg 

151 gCCTTTGTTT ACCAgcatca cgcggctgcg accttgaTTT TTGAACGATA 

201 CTTCGCCgaT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAAAC CGACAAAGCC GATTTGCgga ctggtaaACA CCACGCCAAT 

301 GGTgctgcgg cGCAAACCGC TGCCGATATt cgGgtagcgg ccccgcgtta 

351 ttgcccggca atcttacctt ggtcggcggc ttcatGCAGC AGGGGCagtt 

4 01 ggttggacgc gtcgcccgca ataAAGATAT GCGGAATgct ggtCTGCATg 

4 51 gtCAGCGGAT CGGCAACGGG tacgccgcgc gcgtctttgT CGATATTGAT 

501 GTTTTCCAAA CCGATATtgT CAACGTTCGG ACGGCgACCT ACGGCTGCCA 
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551 ACATATATTC GGCAACAAAT ACGCCTTTTT CGCCATCCTG CTCCCAATGG 

601 ACTtCtACAT TGCCGTCTGC GTCGAGTTTG ACCTCGGTTT TAGCATCCAG 

651 ATGCAGTTTC AATtCtTCTC CGAACACGGC TTTCGCCTCG TCTGAAACAA 

701 CGGGGTCGGA AATGCCGCCG ATGATTCCGC CCAAACCGAA AATTTCAACT 

751 TTCACACCCA AACGGTGCAA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 14; ORF 004.ng>: 

g004 .pep 

1 MVERHIQHLR NGHLHLMRPC QQVSQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVYQHHAAA TLIFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAN 

101 GAAAQTAADI RVAAPRYCPA ILPWSAASCS RGSW LDASPA IKICGMLVCM 

151 VSGSATGTPR ASLSILMFSK PILSTFGRRP TAANIYSATN TPFSPSCSQW 

201 TSTLPSASSL TSVLASRCSF NSSPNTAFAS SETTGSEMPP MIPPKPKIST 

251 FTPKRCNA* 



The following partial DNA sequence was identified in K meningitidis <SEQ ED 15>: 

ia004 . seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCAGG GCCTACGATT 

101 TCCGCGCCGA TAAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTC ACCAGCATCA CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAGAC CGACAAAGCC GATTTGCGGA CTGGTAAACA CCACGCCGAT 

301 GGTGCTGCGC CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

351 GCCGGCAATC TTGCCTTGGT CGGCAGCTTC ATGCAGCAGA GGCAGTTGGT 

4 01 TGGACGCATC GCCTGCGATG AAGATATGCG GAATACTGGT CTGCATGGTC 

451 AGCGGGTCGG CAACAGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATATT 

501 TTCCAAACCG ATATTGTCAA CGTTCGGACG GCGGCCCACG GCTGCCAGCA 

551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 

601 TCTACATTGC CGTCTGCATC GAGTTTGACC TCGGTTTTAG CATCCAGATG 

651 CAGTTTCAAT TCTTCGCCGA ACACGGCGTT CGCCTCGTCT GAAACGACGG 

701 GGTCGGAAAT GCCGCCGATG ATTCCGCCCA AACCGAAAAT TTCAACTTTC 

751 ACGCCCAAAC GGTGCAATGC CTGA 

This corresponds to the amino acid sequence <SEQ ID 16; ORF 004>: 

m004 .pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVHQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAD 

101 GAAPQTAADI RVAAALSPAI LPWSAASCSR GSW LDASPAM KICGILVCMV 

151 SGSATGTPRA SFSILIFSKP ILSTFGRRPT AASIYSATNT PFSPSCSQWT 

201 STLPSASSLT SVLASRCSFN SSPNTAFASS ETTGSEMPPM IPPKPKISTF 

251 TPKRCNA* 



a004 . seq 








1 


ATGGTAGAAC 


GGCATATCCA 


GCATTTGCGG 


51 


GTGCCCAAGC 


CAACAGGTGC 


GCCAAATGTT 


101 


TCTGCGCCGA 


TGAAGCGGCC 


GGTGGCTTTT 


151 


GCCTTTGTTT 


ACCAGCATCA 


CGCGGCTGCG 


201 


CTTCGCCGAT 


GACAAATTCG 


TCGGCTTGGT 


251 


TATTTCAAAC 


CGACAAAGCC 


GATTTGCGGA 


301 


GGTGCTGCGG 


CGCAAACCGC 


CGCCGATATT 


351 


GCCGGCAATC 


TTGCCTTGGT 


CGGCGGCTTC 


401 


TGGACGCGTC 


GCCCGCAATA 


AAGATATGCG 


451 


AGCGGATCGG 


CAACGGGTAC 


GCCGCGCGCA 


501 


TTCCAAACCG 


ATATTGTCAA 


CGTTCGGACG 


551 


TATATTCGGC 


AACAAATACG 


CCTTTTTCGC 


601 


TCTACATTGC 


CGTCTGCGTC 


GAGTTTGGCC 


651 


CAGTTTCAAT 


TCTTCACCGA 


ACACGGCTTT 


701 


GGTCGGAAAT 


GCCGCCGATG 


ATGCCACCCA 


751 


ACGCCCAAAC 


GGTGCAATGC 


CTGA 
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This corresponds to the amino acid sequence <SEQ ID 18; ORF 004.a>: 

a004 .pep 

MVERHIQHLR NGHLHLMCPS QQVRQMFGGR TYDFCADEAA GGFFGIQAHM 

AFVYQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGEHYAD 

GAAAQTAADI RVAAALSPAI LPWSAASCSR GSWL DASPAI KICGILVCIV 

SGSATGTPRA SFSILMFSKP ILSTFGRRPT AASIYSATNT PFSPSCSQWT 

STLPSASSLA SVLASKCSFN SSPNTAFASS ETTGSEMPPM MPPKPKISTF 
TPKRCNA* 



1 
51 
101 
151 
201 
251 



m004/a0O4 94.9% identity over a 257 aa overlap 



m004 .pep 
a004 

in004 .pep 
a004 

in004 .pep 
a004 

m004 .pep 
a004 

ni004 .pep 
a004 



10 20 30 40 50 60 

MVERHIQHLRNGHLHLMCPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 

HI Mlillllllllllllllllll:!)] II:IIIIII|||IMIII:| 

MVERHIQHLRNGHLHLMCPSQQVRQMFGGRTYDFCADEAAGGFFGIQAHMAFVYQHHAAA 

10 20 30 40 50 60 

80 90 100 110 120 
ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHADGAAPQTAADIRVAAALSPAI 
>II>IIIIIIIIIIMIIIIIIII||]!||||M|:|:j|||| ||||t| 

ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGEHYADGAAAQTAADIRVAAALSPAI 
70 80 90 100 110 120 

130 140 150 160 170 180 

LPWSAASCSRGSWLDASPAMKICGILVCMVSGSATGTPRASFSILIFSKPILSTFGRRPT 

l>lllllllltlllllll|:|||MM|:||||||MIIIIII||:|| |||||| 

LPWSAASCSRGSWLDASPAIKICGILVCIVSGSATGTPRASFSILMFSKPILSTFGRRPT 

130 140 150 160 170 180 

190 200 210 220 230 240 

AASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPPM 
llll>NlilIllllliflllMIIIIII:|||||:|||)|||||||||}||(,||j,,, 

AASIYSATNTPFSPSCSQWTSTLPSASSLASVLASKCSFNSSPNTAFASSETTGSEMPPM 
190 200 210 220 230 240 

250 

IPPKPKISTFTPPCRCNAX 
: I I I I I I I M I I I I I t I I 
MPPKPKISTFTPKRCNAX 
250 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORT from K gonorrhoeae 

ORF 004 shows 93.4% identity over a 258 aa overlap with a predicted ORF (ORF 004.ng) 
from AT. gonorrhoeae: 

m004/g004 

10 20 30 40 50 60 

m004 . pep MVERHIQHLRNGHLHLMCPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 
IIIIIMMIIIIIMI 1 III ltlllllill||||||||||MII|]tll:llllll 
g004 MVERHIQHLRNGHLHLMRPCQQVSQMFGGRAYDFRADKAAGGFFGIQAHMAFVYQHHAAA 

10 20 30 40 50 60 

70 80 90 100 110 119 

m004 . pep ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHADGAAPQTAADIRVAAA-LSPA 

= III II I II I II I IMIII III Hill I: MM IN I I I 

g004 TLIFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHANGAAAQTAADIRVAAPRYCPA 

■^0 80 90 100 110 120 

120 130 140 150 160 170 179 

m004.pep ILPWSAASCSRGSWLDASPAMKICGILVCMVSGSATGTPRASFSILIFSKPILSTFGRRP 
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l)lllllllllllllll|||:ltll:|||||ini|||||ij:|||:|||{||||||||| 
g004 ILPWSAASCSRGSWLDASPAIKICGMLVCMVSGSATGTPE^SLSILMFSKPILSTFGRRP 
130 140 150 160 170 180 

180 190 200 210 220 230 239 

m004.pep TAASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPP 

lll = )IIIIM)lt Illlllllllllllllllllllllllt Mill 

g004 TAANIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPP 
190 200 210 220 230 240 

240 250 
m004 .pep MIPPKPKISTFTPKRCNAX 

I 1 I I I I I I I I I I I I I I I I 
g004 MIPPKPKISTFTPKRCNA 
250 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 19>: 

9OO5 . seq 

1 ATGGGGATGG ACAATATTGA TATGTTCATG CCTGAACAAG AGGAAATCCA 

51 ATCAATGTGG AAAGAAATTT TACTGAATTA CGGTATTTTC CTGCTCGAAC 

101 TGCTTACCGT GTTCGGCGCA ATTGCGCTGA TTGTGTTGGC TATCGTACAG 

151 AGTAAGAAAC AGTCGGAAAG CGGCAGTGTC GTACTGACAG ATTTTTCGGA 

201 AAATTATAAA AAACAGCGGC AATCGTTTGA AACATTCTTT TTAAGCGAGG 

251 AAGAGACAAA ACATCAGGAA AAAAAAGAAA AGAAAAAGGA AAAGGCGGAA 

301 GCCAAAGCAG AGAAAAAGCG TTTGAAGGAG GGCGGGGAGA AATCTGCCGA 

351 AACGCAAAAA TCCCGCCTTT TTGTGTTGGA TTTTGACGGC GATTTGTATG 

401 CACACGCCGT AGAATCCTTG CGTCATGAGA TTACGGCGGT GCTTTTGATT 

451 GCCAAGCCTG AAGATGAGGT TCTGCTCAGA TTGGAAAGTC CGGGCGGCGT 

501 GGTTCACGGT TACGGTTTGG CGGCTTCGCA GCTTAGGCGT TTGCGCGAAC 

551 GCAATATTCC GCTGAccgtC gccgTCGATA AGGTCGCGGC AAGCGgcggc 

601 tatatgatgg cgtgtgtgGC GGATAAAATT GTTTCCGCtc cgtttgcggt 

651 catcggttcg gtgggtgtgg tgGcggaagt gcCGAATATC CAccgCctGT 

701 TGAAAAAACA TGATATTGAT GTGGATGTGA TGACGGCGGG CGAATTTAAG 

751 CGCACGGTTA CTTTTATGGG TGAAAATACG GAAAAGGGCA AACAGAAATT 

801 CCGGCAGGAA CTGGAGGAAA CGCATCAGTT GTTCAAGCAG TTTGTCAGTG 

851 AAAACCGCCC CGGGTTGGAT ATTGAAAAAA TAGCGACGGG CGAGCATTGG 

901 TTCGGCCGGC AGGCGTTGGC GTTGAACTTG ATTGACGAGA TTTCGACCAG 

951 TGATGATTTG TTGTTGAAAG CGTTTGAAAA CAAACAGGtt aTCGAAGTGA 

1001 AATATCAGGA GAAGCGAAGC CTGATCCAGC GCATTGGTTT GCAGGCGGAA 

1051 GCTTCCGTTG AAAAGTTGTT TGCCAAACTT GTCAACCGGC GAGCGGATGT 

1101 GATGTAG 

This corresponds to the amino acid sequence <SEQ ID 20; ORF 005 .ng>: 

gOOS.pep 

1 MGMDNIDMFM PEQEEIQSMW KEILLNYG IF LLELLTVFGA lALIV LAIVO 

51 SKKQSESGSV VLTDFSENYK KQRQSFETFF LSEEETKHQE KKEKKKEKAE 

101 AKAEKKRLKE GGEKSAETQK SRLFVLDFDG DLYAHAVESL RHEITAVLLI 

151 AKPEDEVLLR LESPGGWHG YGLAASQLRR LRERNIPLTV AVDKVAASGG 

201 YMMACVAD KI VSAPFAVIGS VGWAE VPNI HRLLKKHDID VDVMTAGEFK 

251 RTVTFMGENT EKGKQKFRQE LEETHQLFKQ FVSENRPGLD lEKIATGEHW 

301 FGRQALALNL IDEISTSDDL LLKAFENKQV lEVKYQEICRS LIQRIGLQAE 

351 ASVEKLFAKL VNRRADVM* 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 21>: 

m005 . seq 

1 ATGGACAATA TTGACATGTT CATGCCTGAA CAAGAGGAAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT ACAGAGTAAG 

151 AAACAGTCGG AwAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

251 CACAACATCA GGAAAAAGAG GAAAAGAAAA AGGAAAAGGC GGAAGCCAAA 
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301 GCAGAGAAAA A.CGTTTGAA GGAGGGTGGG GAGAAATCTG CCGAAACGCA 

351 nAAATCACGC CTTTTTGTGT TGGANNNNNN NNNNNNNNNN NNNNNNNNNN 

4 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNGCGAGCGG CGGTTATATG 

601 ATGGCGTGTG TGGCGGATAA AATTGCTTCC GCTCCGTTTG CGATTGTCGG 

651 TTCGGTGGGT GTGGTGGCGG AAGTACCGAA TATCCACCGC CTGTTGAAAA 

701 AACATGATAT TGATGTGGAT GTGATGACGG CGGGCGAATT TAAGCGCACG 

7 51 GTTACTTTTA TGGGTGAAAA TACGGAAAAG GGCAAACAGA AATTCCGACA 

801 GGAACTGGAG GAAACGCATC AGTTGTTCAA GCAGTTTGTC AGCGAGAACC 

851 GCCCTCAATT GGATATTGAG GAAGTGGC7VA CGGGCGAGCA TTGGTTCGGT 

901 CGGCAGGCGT TGGCGTTGAA CTTGATTGAC GAGATTTCGA CCAGTGATGA 

951 TTTGTTGTTG AAAGCGTTTG AAAACAAACA GGTTATCGAA GTGAAATATC 

1001 AGGAGAAGCA AAGCCTGATC CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 

1051 GTTGA/WiAGT TGTTTGCCAA ACTTGTCAAC CGGCGGGCGG ATGTGATGT A 

1101 G 

This corresponds to the amino acid sequence <SEQ ID 22; ORF 005>: 

m005 .pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGAIA LIV LAIVQSK 

51 KQSXSGSWL TDFSENYKKQ RQSFEAFFLS GEEAQHQEKE EKKKEKAEAK 

101 AEKXRLKEGG EKSAETXKSR LFVLXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXASGGYM 

201 MACVAD KIAS APFAIVGSVG WAEV PNIHR LLKKHDIDVD VMTAGEFKRT 

251 VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 

301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 

351 VEKLFAKLVN RRADVM* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23>: 

a005 . seq 

1 ATGGACAATA TTGACATGTT CATGCCTGAA CAAGAGGAAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT ACAGAGTAAG 

151 AAACAGTCGG AAAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

251 CAAAACATCA GGAAAAAGAG GAAAAGAAAA AGGAAAAGGC GGAAGCCAAA 

301 GCAGAGAAAA AGCGTTTGAA GGAGGGTGGG GAGAAATCTT CCGAAACGCA 

351 AA/VATCCCGC CTTTTTGTGT TGGATTTTGA CGGCGATTTG TATGCACACG 

401 CCGTAGAATC CTTGCGTCAT GAGATTACGG CGGTGCTTTT GATTGCCAAG 

451 CCTGAAGATG AGGTTCTGCT TAGATTGGAA AGTCCGGGCG GCGTGGTTCA 

501 CGGTTACGGT TTGGCGGCTT CGCAGCTTAG GCGTTTGCGC GAACGCAATA 

551 TTCCGCTGAC CGTCGCCGTC GATAAGGTGG CGGCGAGCGG TGGTTATATG 

601 ATGGCGTGTG TGGCGGATAA AATTGTTTCC GCTCCGTTTG CGATTGTCGG 

651 TTCGGTGGGT GTTGTAGCGG AAGTACCGAA TATCCACCGC CTGTTGAAAA 

701 AACATGATAT TGATGTGGAT GTGATGACGG CGGGCGAATT TAAGCGCACG 

751 GTTACTTTTA TGGGTGAAAA TACGGAAAAG GGCAAACAGA AATTCCGACA 

801 GGAACTGGAG GAAACGCATC AGTTGTTCAA GCAGTTTGTC AGCGAGAACC 

851 GCCCTCAATT GGATATTGAG GAAGTGGCAA CGGGCGAGCA TTGGTTCGGT 

901 CGGCAGGCGT TGGCGTTGAA CTTGATTGAC GAGATTTCGA CCAGTGATGA 

951 TTTGTTGTTG AAAGCGTTTG AAAACAAACA GGTTATCGAA GTGAAATATC 

1001 AGGAGAAGCA AAGCCTGATC CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 

1051 GTTGAAAAGT TGTTTGCCAA ACTTGTCAAC CGGCGGGCGG ATGTGATGTA 

1101 G 

This corresponds to the amino acid sequence <SEQ ID 24; ORF 005. a>: 

a005 .pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGAIA LIV LAIVOSK 

51 KQSESGSWL TDFSENYKKQ RQSFEAFFLS GEEAKHQEKE EKKKEKAEAK 

101 AEKKRLKEGG EKSSETQKSR LFVLDFDGDL YAHAVESLRH EITAVLLIAK 

151 PEDEVLLRLE SPGGWHGYG LAASQLRRLR ERNIPLTVAV DKVAASGGYM 

201 MACVADK IVS APFAIVGSVG WAEV PNIHR LLKKHDIDVD VMTAGEFKRT 

251 VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 
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301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 
351 VEKLFAKLVN RRADVM* 



moo5/aoo5 79.2% identity over a 366 aa overlap 



mOOS.pep 
a005 

mOOS , pep 
a005 

in005,pep 
a005 

mOOS.pep 
a005 

mOOS.pep 
aOOS 

mOOS.pep 
aOOS 



10 20 30 40 50 60 

MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSXSGSWL 
IfllllllllllilllllltlllllllllllllllM illllllllll IIMII 

MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSESGSWL 
10 20 30 40 50 . 60 

70 80 90 100 110 120 

TDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKXRLKEGGEKSAETXKSR 

lllllllllllllllllllllllhlllllllltlllllllM lllllllll: 

TDFSENYKKQRQSFEAFFLSGEEAKHQEKEEKKKEKAEAKAEKKRLKEGGEKSSETQKSR 

'?0 80 90 100 110 120 

130 140 150 160 170 180 

LEVI^XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX^^ 

LFVLDFDGDLYAHAVESLRHEITAVLLIAKPEDEVLLRLESPGGWHGYGLAASQLRRLR 
130 140 150 160 170 180 

190 200 210 220 230 240 

XXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGWAEVPNIHRLLKKHDIDVD 

' l)INIIIIIIIIt:||||[|||tltlllMI Mill 

ERNIPLTVAVDKVAASGGYMMACVADKIVSAPFAIVGSVGWAEVPNIHRLLiCKHDIDVD 

190 200 210 220 230 240 

250 260 270 280 290 300 

VMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHWFG 
NIIIMNIMIMIIMIIIIIDIIIIIlllMllllilMllMiiii 

VMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHWFG 
250 260 270 280 290 300 

310 320 330 340 350 360 

RQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKLVN 
IKII MllllliliMlllllllllllillllllllllllliliiiiiii,], 

RQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKLVN 
310 320 330 340 350 360 



mOOS.pep 



aOOS 



RRADVMX 
I I 1 I I I t 
RRADVMX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 005 shows 77.0% identity over a 366 aa overlap with a predicted ORF (ORF OOS.ng) 
from N. gonorrhoeae: 

m005/g005 



10 20 30 40 SO 

mOOS.pep MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSXSGSV 

If itiiiiiiiiiiiiiii I inn nnininii nii 

gOOS MGMDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSESGSV 
10 20 30 40 50 60 

60 70 80 90 100 HO 

mOOS . pep VLTDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKKRLKEGGEKSAETXK 

mill II Ml III II inn I 1 1 : : 1 1 1 1 n ii 1 1 1 1 n n n n n 1 1 1 n n n i 

g005 VLTDFSENYKKQRQSFETFFLSEEETKHQEKKEKKKEKAEAKAEKKRLKEGGEKSAETQK 
''O 80 90 100 110 120 
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120 130 140 150 160 170 

m005 . pep SRLFVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
mill : 

SRLFVLDFDGDLYAHAVESLRHEITAVLLIAKPEDEVLLRLESPGGVVHGYGLAASQLRR 
130 140 150 160 170 180 

180 190 200 210 220 230 

m005 . pep XXXXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGWAEVPNIHRLLKKHDID 

• I III III mil II: ::imilllll|||||imiM 

5005 LI^ERNrPLTVAVDKVAASGGYMMACVADKIVSAPFAVIGSVGWAEVPNIHRLLKKHDID 
190 200 210 220 230 240 

240 250 260 270 280 290 

in005 . pep VDVMTAGEFKRTVTFWGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHW 

Kiiiiiiiiiiiiiimiimimiiiiiiiiiimiiiiii mi::iiim 

g005 VDVMTAGEFKRTVTFWGENTEKGKQKFRQELEETHQLFKQFVSENRPGLDIEKIATGEHW 
250 260 270 280 290 300 

300 310 320 330 340 350 

m005 . pep FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKL 

'"'iiimiiiiimmiii m i m m : m m m 1 1 1 1 1 1 1 1 1 1 1 1 

g005 FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKRSLIQRIGLQAEASVEKLFAKL 
310 320 330 340 350 360 

360 

m005 , pep VNRRADVMX 
III III III 
g005 VNRRADVMX 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 25>: 

gOOS. seq 

1 ATGCTGCTGG TGCTggaatt ttggttCGGc gtGtCGGCGG TGGGCatact 

51 tgCGTTGTTT TTATGGCttt TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC gcgACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GTACCGCCAT TACGGACTGG TTTCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCG 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACGGCA GCGCGGGGCA TATTTATTCG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATG AGTTTGGACG ATGTGCCGCG ATTGGTCGAA CAATATTCCA 

401 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 

451 GCCGGAACTT GA 

This corresponds to the amino acid sequence <SEQ ED 26; ORF 006.ng>: 

g006 .pep 

1 MLLVLEFWFG VSAVGILALF LWLLPRFAAI SENLYFRLNN SLERDNHFIR 

51 KGDERQLYRH YGLVSRLRVL ISNREAFGYL CVGAAMGILF GFAFVMM TLK 

101 GYGSAGHIYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 

151 AGT* 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 27>: 

m006.seq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACG GGCGGCAGCT GTACCGCCAT TACGGACTGG TTGCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 
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351 GTTTGCCATG AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 
401 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 
451 GCCGGAACTTGA 



This corresponds to the amino acid sequence <SEQ ID 28; ORF 006>: 

m006 .pep 

1 MLLVLEFWVG VSAVGIIiALF LWLLPRFAAI SENLYFRLNN SLERDNHFIR 

51 KGDRROLYRH YGLLARLRVL ISNREAFGYL CVGTAMGILF GFAFVM MTLK 

101 GYSSAGHVYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 

151 AGT* 



The following partial DNA sequence was identified in K meningitidis <SEQ E) 29>: 

a006. saq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAAGAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GGACCGCCAT TACGGACTGC TTGCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATA AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGAAACG GAACATCAAA 

4 51 GCCGGAACTT GA 



This corresponds to the amino acid sequence <SEQ ID 30; ORF 006.a>: 

a006 .pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLKN SLERDNHFIR 

51 KGDERQLDRH YGLLARLRVL ISNREAFGY L CVGTAMGILF GFAFVMM TLK 

101 GYSSAGHVYS VGTYLWMFAI SLDDVPRLVE QYSNLKDIGQ RIEWSKRNIK 

151 AGT* 



m006/a006 96,7% identity over a 153 aa overlap 

10 20 30 40 50 60 

mOOe.pep MLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERDNHFIRKGDRRQLYRH 

t!lllllllll)lllltlltllllll||j|||||||||: Iiniilrilt )| 

a006 MLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLKNSLERDNHFIRKGDERQLDRH 

10 20 30 40 50 60 

'^0 80 90 100 110 120 

m006 . pep YGLLARLRVL XSNREAFGYLCVGTAMG I LFGFAFVMMTLKGYSSAGHVYS VGTYLWMFAM 

XIIXI Illllllilllliill Ililllllllllll: 

a006 YGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSAGHVYSVGTYLWMFAI 

70 80 90 100 110 120 

130 140 150 

mOOG.pep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
I I I I I I I I I I I M I I I i I I I I I I I I : I I I I I I I I 
a006 SLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 

130 140 150 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 006 shows 95.4% identity over a 153 aa overlap with a predicted ORF (ORF 006.ng) 
from A^. gonorrhoeae: 

m006/g006 

10 20 30 40 50 60 

m006 . pep MLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERDNHFIRKGDRRQLYRH 
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llllllll IMIIMIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIhllllll 

g 0 0 6 MLLVLEFWFGVSAVGI LALFLWLLPRFAAI S ENLYFRLNNSLERDNH FI RKGDERQLYRH 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 0 6 . pep YGLLARLRVLI SNREAFGYLCVGTAMGI LFGFAFVMMTLKGYSS AGHVYSVGTYLWMFAM 

III^^IIIMIIIIIIIIII|||:||||||||||||||||||:||||:||||{||||{|{ 
g006 YGLVSRLRVLISNREAFGYLCVGAAMGILFGFAFVMMTLKGYGSAGHIYSVGTYLWMFAM 

70 BO 90 100 110 120 

130 140 150 

m006 . pep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 

Mlllllllllllllllllllllllllllllll 

9II8 SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGT 
130 140 150 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 3 1>: 

g006-i.seq 

1 ATGTGGAAAA TGTTGAAACA CATAGCCAAA ACCCACCGCA AGCGATTGAT 

51 TGGCACATTT TCCCCGGTCG GACTGGAAAA CCTTTTGATG CTGGGGTATC 

101 CGGTGTTTGG CGGCTGGGCG ATTAATGCCG TGATTGCGGG GAGGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTATTT TTGATGTGGC TGGTCGGTGC 

201 GGCACGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

251 TCGCCGTGCC GGTTGTGTTG GAACAACGGC AGCGGCAAGT CCCGCATTCA 

301 GCGGTAACTG CACGGGTTGC CCTGTCGCGT GAATTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

401 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

451 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGT^AAAGG CGACGAGCGG CAGCTGTACC GCCATTACGG ACTGGTTTCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 

651 CGGCGCGGCG ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 

701 TCAAAGGCTA CGGCAGCGCG GGGCATATTT ATTCGGTCGG CACTTATCTG 

751 TGGATGTTTG CCATGAGTTT GGACGATGTG CCGCGATTGG TCGAACAATA 

801 TTCCAATTTG AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 

851 TCAAAGCCGG AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 32; ORF 006-l.ng>: 

g006-l.pep 

1 MWKMLKHIAK THRKRLIGTF SPVGLENLLM LGYPVFGGWA INAVIAGRVW 

SI Q ALLYALWF LMWLVGAA RR lADTRTFTRI YTEIAVPWL EQRQRQVPHS 

101 AVTARVALSR EFVSFFEEHL PIA ATSWSI FGACIMLLVL E FWVGVSAVG 

151 ILALFLWLL P RFAAISENLY FRLNNSLERD NHFIRKGDER QLYRHYGLVS 

201 RLRVLISNRE AFGY LCVGAA MGILFGFAFV M MTLKGYGSA GHIYSVGTYL 

251 WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 33>: 

m006-l . seq 

1 ATGTGGAAAA TGTTGAAACA CATAGCCCAA ACCCACCGCA AGCGATTGAT 

51 TGGCACATTT TCCCTGGTCG GACTGGAAAA CCTTTTGATG CTGGTGTATC 

101 CGGTGTTTGG CGGCCGGGCG ATCAATGCCG TGATTGCGGG GGAGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTGCTT TTGATGTGGC TGGTCGGTGC 

201 GGTGCGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

251 TCGCCGTGCC GGTCGTGTTG GAACAGCGGC AGCGACAAGT CCCGCATTCG 

301 GCGGTAACTG CGCGGGTTGC CCTGTCGCGT GAGTTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

4 01 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACCGGCGG CAGCTGTACC GCCATTACGG ACTGCTTGCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 
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651 CGGCACGGCG ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 

701 TCAAAGGCTA CAGCAGCGCG GGGCATGTCT ATTCGGTCGG CACTTATCTG 

751 TGGATGTTTG CCATGAGTTT GGACGACGTG CCGCGATTGG TCGAACAATA 

801 TTCCAATTTG AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 

851 TCAAAGCCGG AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 34; ORF 006- 1>: 



mO 06-1 .pep 



1 
51 
101 
151 
201 
251 



MWKMLKHIAQ THRKRLIGTF SLVGLENLLM LVYPVFGGRA INAVIAGEVW 
QALLYALWL LMWLVGAV RR lADTRTFTRI YTEIAVPVVL EQRQRQVPHS 
AVTARVALSR EFVSFFEEHL PI AATSWSI FGACIMLLVL E FWVGVSAVG 
ILALFLWLL P RFAAISENLY FRLNNSLERD NHFIRKGDRR QLYRHYGLLA 
RLRVLISNRE AFGY LCVGTA MGILFGFAFV M MTLKGYSRA GHVYSVGTYL 
WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 



moo6-i/goo6-i 95.5% identity in 288 aa overlap 



m006-l .pep 
g006-l 



m006-l .pep 
g006-l 



m006-l .pep 
g006-l 



mOOe-l .pep 
g006-l 



m006-l .pep 
g006-l 



10 20 30 40 50 60 

MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALWL 
ll>>ltlll:lllllll||)| lllllllll llllli lllllllhlllllllllii: 

MWKMLKHIAKTHRKRLIGTFSPVGLENLLMLGYPVFGGWAINAVIAGRVWQALLYALWF 
10 20 30 40 50 60 

70 80 90 100 110 120 

LMWLVGAVRRIADTRTFTRIYTEIAVPWLEQRQRQVPHSAVTARVALSREFVSFFEEHL 

NIMIIHMIMIIIIIil Ill Illllllltltllllllillllllll 

LMWLVGAARRIADTRTFTRIYTEIAVPWLEQRQRQVPHSAVTARVALSREFVSFFEEHL 

''O 80 90 100 110 120 

130 140 150 160 170 180 

PIAATSWSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERD 

■"I f>> III III KM I III II III II II. II II III II Mill I III I 

PIAATSVVSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERD 

130 140 150 160 170 180 

190 200 210 220 230 240 

NHFIRKGDRRQLYRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 
l>N>lll:lllllllll::|||||[)||j]|||||||:|||]|||,,,, 

NHFIRKGDERQLYRHYGLVSRLRVLISNREAFGYLCVGAAMGILFGFAFVMMTLKGYGSA 
190 200 210 220 230 240 

250 260 270 280 289 

GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 

ll'OIIIII I lltlllllMIIIIIIII Ill 

GHIYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 

250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 35>: 

a006-l ^'=^" /i^a-*-!--; ^ 



, seq 
1 



51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



(partial) 

.AGCCAAAACC ACCGCAAGCG ATTGATTGGC ACATTTTTTC TGGTCGGACT 
GGAAAACCTT TTGATGCTGG TGTATCCGGT GTTTGGCGGC TGGGCGATTA 
ATGCCGTGAT TGCGGGGCAG GCGTGGCAGG CGTTGCTGTA CGCTTTGGTT 
GTGCTTTTGA TGTGGCTGGT CGGTGCGGCG CGGCGGATTG CCGATACGCG 
CACGTTTACG CGGATTTATA CCGAAATCGC CGTGCCGGTT GTGTTGGAAC 
AGCGGCAGCG GCAAGTCCCG CATTCGGCGG TAACTGCGCG GGTTGCCCTG 
TCGCGTGAGT TTGTCAGCTT TTTTGAAGAA CACCTGCCGA TTGCCGCGAC 
ATCCGTCGTA TCCATATTCG GCGCGTGCAT CATGCTGCTG GTGCTGGAAT 
TTTGGGTCGG CGTGTCGGCG GTGGGCATAC TTGCGTTGTT TTTATGGCTT 
TTGCCACGTT TTGCCGCCAT CAGCGAAAAC CTGTATTTCC GCCTGAAGAA 
CAGCTTGGAA CGCGACAACC ACTTTATCCG AAAAGGCGAC GAGCGGCAGC 
TGGACCGCCA TTACGGACTG CTTGCGCGCC TGCGTGTGCT GATTTCCAAC 
CGCGAAGCCT TCGGCTATCT CTGCGTCGGC ACGGCGATGG GTATTTTGTT 
CGGCTTTGCT TTTGTGATGA TGACGCTCAA AGGCTACAGC AGCGCGGGGC 
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701 ATGTCTATTC GGTCGGCACT TATCTGTGGA TGTTTGCCAT AAGTTTGGAC 
751 GACGTGCCGC GATTGGTCGA ACAATATTCC AATTTGAAAG ACATCGGACA 
801 ACGGATAGAG TGGTCGAAAC GGAACATCAA AGCCGGAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 36; ORF 006-1. a>: 

a006-l.pep (partial) 

1 . . SQNHRKRLIG TFFLVGLENL LMLVYPVFGG WAINAVIAGQ AW QALLYALV 
51 VLLMWLV6AA RRIADTRTFT RIYTEIAVPV VLEQRQRQVP HSAVTARVAL 

101 SREFVSFFEE HLPI AATSW SIFGACIMLL VL EF WVGVSA VGILALFLWL 

151 LPRFAAISEN LYFRLKNSLE RDNHFIRKGD ERQLDRHYGL LARLRVLISN 

201 REAFG YLCVG TAMGILFGFA FVM MTLKGYS SAGHVYSVGT YLWMFAISLD 

251 DVPRLVEQYS NLKDIGQRIE WSKRNIKAGT * 



a006-l/m006-l 



a006-l.pep 



m006-l 



a006-l ,pep 



m006-l 



a006-l .pep 



m006-l 



a006-l, 



m006-l 



pep 



a006-l.pep 
m006-l 



95.7% identity in 280 aa overlap 

10 . 20 30 40 50 

SQNHRKRLIGTFFLVGLENLLMLVYPVFGGWAINAVIAGQAWQALLYALWL 
'IHIIIIIIII IIIMIIIIIIIIIIM lltMIII::||||||||||| 
MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALWL 
10 20 30 40 50 60 

60 70 80 90 100 110 

LMWLVGAARRIADTRTFTRIYTEIAVPWLEQRQRQVPHSAVTARVALSREFVSFFEEHL 

IINIIt:IIIIIMIIII[ IIIIIIIIIMIMIIIIIIIIIIM li 

LMWLVGAVRRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSREFVSFFEEHL 
70 80 90. 100 110 120 

120 130 140 150 160 170 

PIAATSWSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLKNSLERD 

N till Illlllllllllllllii Illlllllhllllll 

PIAATSWSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERD 
130 140 150 160 170 180 

180 190 200 210 220 230 

NHFIRKGDERQLDRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 

■IIMMI: Illlll llllilllllllllllilllllllllllMIII 

NHFIRKGDRRQLYRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 
190 200 210 220 230 240 

240 250 260 270 280 

GHVYSVGTYLWMFAISLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 

IlilNI I: IIIIIIIIIIIIIIIMI:|||||||| 

GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
250 260 270 280 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 37>: 

gOO? . seq 

1 atgaACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGcgC 

51 CGCcGCTTCT GCCGccgaca acAGCatcat gaCaAAAGGG CAAAAAGTGT 

101 ACGAATCcAa ctGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGtTTC CTccgctTTT CCggtcgGac tgtattatga acaAACCGCa 

201 cgTCCtgctg cacagcatgg tcaaaggcAt cgacgggaca ttcaaagtgg 

251 agcggcaaaa cctacgacgg atttatgCcc gcaaccgcca tcagcgATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ID 38; ORF 007,ng>: 

gOOT.pep 

1 MNTTRLPTAF ILCCLCAAAS AA DNSIMTKG QKVYESNCIA CHGKKGEGRG 

51 TAFPPLFRSD CIMNKPHVLL HSMVKGIDGT FKVERQNLRR lYARNRHQRC 

101 GHCRRRHLYH ERL* 



The following partial DNA sequence was identified in A'', meningitidis <SEQ ID 39>: 
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m007 . seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTC. 

251 AACGGCAAT^ CCTACAACGG ATTCATGCCC GCAACCGCCA TCAGCGATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 



This corresponds to the amino acid sequence <SEQ ID 40; ORF 007>: 

m007 .pep 

1 MNTTRLPTAL VLGCFCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARNRHQRC 

101 GHCRRRHLYH ERL* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 41>: 

a007.seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GIKQPJ^GQQ CAAAAAGTGT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTC 

251 AACGGCAAAA CCTACAACGG ATTCATGCCC GCCACTGCCA TCAGCGATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ID 42; ORF 007.a>: 

a007.pep 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARHCHQRC 

101 GHCRRRHLYH ERL* 



m007/aoo7 97.3% identity over a 1 13 aa overlap 

10 20 30 40 50 60 

m007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

X 1111111:1 III! Itlllllill IIIMIIIIIIIII 

MNTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
10 20 30 40 50 60 



a007 



70 80 90 100 110 

m007 . pep F I MKKPQVLLHSMVKG I NGT IKVXRQNLQR I HARNRHQRCGHCRRRHLYHERLX 

IIIIM IIMIIIIIIIMIIIIIIIIII: lllllll iiii 

FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARHCHQRCGHCRRRHLYHERLX 
■70 80 90 100 110 



a007 



Computer analysis of this amino acid sequence gave the followdng results: 
Homology vnth a predicted ORF from A^. gonorrhoeae 

ORF 007 shov^^s 86.7% identity over a 113 aa overlap with a predicted ORF (ORF 007.ng) 
from A^. gonorrhoeae: 

m007/g007 

10 20 30 40 50 60 

in007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

'"">"">*•" I: Ml Ml I I II I I IM:|IIMIIItlll 1111:111 

g007 MNTTRLPTAFILCCLCAAASAADNSIMTKGQKVYESNCIACHGKKGEGRGTAFPPLFRSD 

10 20 30 40 50 60 



m007.pep 



70 80 90 100 110 

FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARNRHQRCGHCRRRHLYHERLX 
M : M : M I II I I I II : II : II I I I I : I I : I I I I I I I I I I I II I I I I I I I I 
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g007 CIMNKPHVLLHSMVKGIDGTFKVERQNLRRIYARNRHQRCGHCRRRHLYHERL 
70 80 90 100 110 

The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 43>: 

g007-l . seq (partial) 

1 ATGAACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGTTTC CTCCGCTTTT CCGGTCGGAC TATATTATGA ACAAACCGCA 

201 CGTCCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 

301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAGGC AAAAAAAAC. 

This corresponds to the amino acid sequence <SEQ ID 44; ORF 007-l.ng>: 

g007-l.pep (partial) 

1 MNTTRLPTAF ILCCLCAAAS AA DNSIMTKG QKVYESNCIA CHGKKGEGRG 
51 TAFPPLFRSD YIMNKPHVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKG KKN. . . 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 45>: 

m007-l .seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 
51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA A/VAAACCGCA 
201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 
251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 
301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 
351 CGGAAGCGTT ACCGAAAZ^G ACGTAAAACA GGCAAAAAGC AAAAAAAACT 
401 AA 

This corresponds to the amino acid sequence <SEQ ID 46; ORF 007- 1> 

m007-l .pep 

1 MNTTRLPTAL VLGCFCAAAS AA DNSIMTKG QKVYESNCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKS KKN* 

m007-i / g007-i 91.7% identity in 133 aa overlap 

10 20 30 40 50 60 

m007-l . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
M I I I I I I I : : I I : I M I I I I I I I I 1 I I I I I I t I I n : I I I i I I I I I t t I I I I I : | I | 
gOO 7 - 1 MNTTRLPTAFI LCCLCAAAS AADNS IMTKGQKVYESNCIACHGKKGEGRGTAFPPLFRS D 

10 20 30 40 50 60 

70 80 90 100 110 120 

m007-l . pep FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 

'W'W' lillMlllllllllllMIIIII Mil 

g007-l YIMNKPHVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 

70 80 90 100 110 120 

130 

mO 07-1. pep TEKDVKQAKSKKNX 

111111111:111 
g007-l TEKDVKQAKGKKN 
130 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ED 47>: 

a007-l.seq (partial) 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 
51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
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151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CCACTGCCAT CAGCGATGCG 

301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAAAC AAAAAA. . 

This corresponds to the amino acid sequence <SEQ ID 48; ORF 007-l.a>: 

a007-l.pep (partial) 

1 MNTTRLPTAL VLGCLCAAAS AA DNSIMTKG QKVYESNCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDN6GGSV TEKDVKQAKN KK. . 

inoo7-i/aoo7-i 98.5% identity in 132 aa overlap 

10 20 30 40 50 60 

m007-l . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
MIMIItllllll:||||||||||||tll[|||||)|||||MIIIIII||||||||t( 
a007-l MNTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 

70 80 90 100 110 120 

m007-l.pep FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 
MilllltlMlllltllllllllllillllllllMllllilllllllllllilMllj 
a007-l FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 
70 80 90 100 110 120 

130 

mO 07-1. pep TEKDVKQ7VKSKKNX 

I I I I I I t It : I I 
a 0 0 7 - 1 TEKDVKQAKNKK 
130 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 49>: 

g008 . seq 

1 ATGAACAACA GACATTTTGC CGTCAtcgCC TTGGGCAGCA ACCTTGACAA 

51 CCCCGCACAA CAAATacgcg gcgcattaga cgcgctctcg tcccatcctg 

101 acatccggct tgaaCaggtt tcctcactgt aTatgaccgc acctgtcggt 

151 tacgAcaaTC agcccgATTT CATCaatgcc gTCTgcaccg TTTCCACCAC 

201 CtTGGACGGC ATTGcccTGC TTGCCgaACT CAAccgTATC GAAGCCGATT 

251 TCGGACGCGA aCGCAGTTTC CGCAATGCAC CGCGCACATT GGATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACCCCCGCC TTACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAATACG GA7VAGGTTGT CGAATTGTCA 

451 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGACA GGTAA 

This corresponds to the amino acid sequence <SEQ ID 50; ORF 008.ng>: 

gOOS.pep 

1 MNNRHFAVIA LGSNLDNPAQ QIRGALDALS SHPDIRLEQV SSLYMTAPVG 

51 YDNQPDFINA VCTVSTTLDG lALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKYGKWELS 

151 KRLGNQGIRL LPDR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 51>: 

m008 .soq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCTCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT TGTCAATGCC GTCTGCACCG TTTCCACCAC 

201 TCTGGACGGC ATTGCCyTGC TTGCCGAACT CAACCGTATC GAGGCTGATT 

251 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GkATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACACsCGAC TcACCtTGCC 
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351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATCCGCCCT TTGGCAGAAA 

401 TCCTCCCTGA TTTTGTTTTA GGAAAACACG GAAAGGTTGC CGMTTGTCA 

4 51 AAACGGyTGG GCAATCAAGG TATCCGTCTT TTACCGGACA GGTAATT 

This corresponds to the amino acid sequence <SEQ ED 52; ORF 008>: 

mOOS.pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG lALLAELNRI EADFGRERSF RNAPRTLXLD 

101 IIDFDGISSD DTRLTLPHPR AHERSFVIRP LAEILPDFVL GKHGKVAELS 

151 KRLGNQGIRL LPDR* 



The following partial DNA sequence was identified in N, meningitidis<SEQ ID 53>: 

aOOa, seq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCCCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT CGTCAATGCC GTCTGCACCG TTTCCACCAC 

201 CTTGGACGGC ATTGCCCTGC TTGCCGAACT CAACCGTATC GAAGCCGATT 

251 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GGATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACCCCCGAC TCACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAACACG GAAAGGTTGC CGAATTGTCA 

451 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGATA AGTAA 

This corresponds to the amino acid sequence <SEQ ID 54; ORF 008.a>: 

aOOe.pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG lALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKHGKVAELS 

151 KRLGNQGIRL LPDK* 



moo8/a008 97.6% identity over a 164 aa overlap 

10 20 30 40 50 60 

mOOS.pep MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 

M I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I 1 I I I I t I I I I I I I I I I I I I 

MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 
10 20 30 40 50 60 



a008 



70 80 90 100 110 120 

mOOS.pep VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLXLDIIDFDGISSDDTRLTLPHPR 

illllllll Illlllllllll lllllllllllll IMIIMI 

VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLDLDIIDFDGISSDDPRLTLPHFR 
"70 80 90 100 110 120 



a008 



130 140 150 160 

mOOS.pep AHERSFVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 

IIMIIIIII:IIIIIIIIIIMIIIIIItlll|{:| 

a008 AHERSFVIRPLAEILPDFILGKHGKVAELSKRLGNQGIRLLPDKX 

130 140 150 160 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 008 shows 92.7% identity over a 164 aa overlap with a predicted ORF (ORFOOS.ng) 
from N. gonorrhoeae: 

m008/g008 

20 30 40 50 60 

mOOe.pep MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 
MII||l||||||||:lMM:|:|||:l||||||||:|:||,|,,|,||||[|,i,.,| 

g008 MNNRHFAVIALGSNLDNPAQQIRGALDALSSHPDIRLEQVSSLYMTAPVGYDNQPDFINA 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ta008 , pep VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLXLDIIDFDGISSDDTRLTLPHPR 

Mil 1 I I 1 I I I I I I I I I I M I I 1 I I I I I I I I I I I I I I I I I I I ) I I i 

g008 VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLDLDIIDFDGISSDDPRLTLPHPR 
70 80 90 100 110 120 

130 140 150 160 

m008 . pep AHERSFVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 
lllillflllllllllll:lll:|||:||||||||||tlllllll 
gOOB AHERSFVIRPLAEILPDFILGKYGKWELSKRLGNQGIRLLPDRX 

130 140 150 160 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 55>: 

g009. seq 

1 ATGCCCCGCG CTGCCGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GCGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGAA ACCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTcg cGTTCCAAGC 

201 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AaaaGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 56; ORF 009.ng>: 

g009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARNQSVMAV 

51 QLPLVAFSDK VWAFQAWQ AEIQVFA DGG KTWQKP* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 57>: 

i&009.seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC CGGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

201 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AAAAGCCATA A 



This corresponds to the amino acid sequence <SEQ ID 58; ORF 009>: 

mOOd.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPPVAFSDK VWAFQAWQ AEIQVFA DGG ICTWQKP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF fi-om N. gonorrhoeae 

ORF 009 shows 97.7% identity over a 86 aa overlap with a predicted ORF (ORF 009.ng) 
fi'om N, gonorrhoeae: 

in009/g009 

10 20 30 40 50 60 

mOO 9 . pep MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPPVAFSDK 

MIMIIIIIIII I nil 1111111:1 I Mini 

g009 MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARNQSVMAVQLPLVAFSDK 

10 20 30 40 50 60 

70 80 
m009 . pep VWAFQAWQAEIQVFADGGKTWQKPX 
I I I I I I I I t I I I I I I I I I I I I I I I I I I 
g009 VWAFQAVVQTVEIQVFADGGKTWQKPX 
70 80 
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The following partial DNA sequence was identified in K meningitidis <SEQ ID 59>: 

a009.seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTC 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

201 TGTTCTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 60; ORF 009.a>: 

a009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPLVAFSDK VWAFQAVLQ AEIQVFA DGG KTWQKP* 

m009/a009 97.7% identity over a 86 aa overlap 

10 20 30 40 50 60 

m009 . pep MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPPVAFSDK 

I >) I I I I I I I I I I I I I I I I I I I I I I M I I I I I t I I I I [ I I I I I I I I II 

a 0 0 9 MPRAAVAFERHHHKSKAEQNTHRRADAE I AEG FAVGNQHTQARKQS VMAVQLP LVAFS DK 

10 20 30 40 50 60 

70 80 
m009 . pep VWAFQAWQAEIQVFADGGKTWQKPX 
IIIIIIIIMIIIIIIIilllllllll 
a009 WVAFQAVLQAEIQVFADGGKTWQKPX 

70 80 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 61>: 

gOlO . seq 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 

351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

401 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

451 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 

651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

701 AAGATATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

751 GTGTTGATTA, CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAacgc 

801 cgacggcgaA cgcTTTATGG T^cgctatgc GCcgACCGta aAagaCTTGG 

851 CTTCTCGCga cgtGGTTTCA CgcgcGatgG CGatggaAAt ctatgaaggt 

901 cgcggctgTG GtaaAAAcaA agaCCacgtC TTACTGAAAA TCGACcAtAt 

951 cggtGCAGAA AAAATTATGG AA/yWVCTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCagtttgc cGGTATCGAT CCGATTAAAG ACCCGATTcc ggttgTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCcg aCCAATTATC ACGGTGAAGT 

1101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

• 1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 

1201 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 62; ORF 010.ng>: 

gOlO.pep 
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1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDWS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEVWP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLWF RPTPR* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 63>: 

mOlO . seq (PARTIAL) 

1 . . nTCCAATTAT CCAAATCCGG TCTGAATTGT GCCGTTTTGT CTAAAGTGTT 

51 CCCGACCCGT TCGCATACCG TAGCGGCGCA GGGCGGTATT TCCGCCTCTn 

101 TGGGTAATGT GCAGGAAGAC CGTTGGGACT GGCACATGTA CGATACCGTG 

151 AAAGGTTCCG ACTGGTTGGG CGACCAAGAT GCGATTGAGT TTATGTGCCG 

201 CGCCGCGCCT GAAGCCGTAA TTGAGTTGGA ACACATGGGT ATGCCTTTTG 

251 ACCGTGTGGA AAGCGGTAAA ATTTATCAGC GTCCTTTCGG CGGCCATACT 

301 GCCGAACACG GTAAACGCGC GGTAGAACGC GyCTGTGCGG TTGCCGACCG 

351 TACAGGTCAT GCGATGCTGC ATACTTTGTA CCAACAAAAC GTCCGTGCCA 

401 ATACGCTU^TT CTTTGTGGAA TGGACGGCAC AAGATTTGAT TCGTGATGAA 

451 AACGGCGATG TCGTCGGCGT AACCGCCATG GAAATGGAAA CCGGCGAAgT 

501 TTATATTTTC CACGCTAAAG CTGTGATGTT TGCTACCGGC GGCGGCGGTC 

551 GTATTTATGC GTCTTCTACC AATGCCTATA TGAATACCGG CGATGGTTTG 

601 GGTATTTGTG CGCGTGCAGG TATCCCGTTG GAAGACATGG AATTCTGGCA 

651 ATTCCAGCCG ACCGGCGTGG CGGGTGCGGG CGTGTTGATT ACCGAA 

This corresponds to the amino acid sequence <SEQ ID 64; ORF 010>: 

mOlO.pep (PARTIAL) 

1 . . XQLSKSGLNC AVLSPCVFPTR SHTVAAQGGI SASXGNVQED RWDWHMYDTV 

51 KGSDWLGDQD AIEFMCRAAP EAVIELEHMG MPFDRVESGK lYQRPFGGHT 

101 AEHGKRAVER XCAVADRTGH AMLHTLYQQN VRANTQFFVE WTAQDLIRDE 

151 NGDWGVTAM EMETGEVYIF HAKAVMFATG GGGRIYASST NAYMNTGDGL 

201 GICARAGIPL EDMEFWQFQP TGVAGAGVLI TE. . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 65>: 

aOlO.seq 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

401 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

451 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAMC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

701 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 
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1301 ATGCCGGCGA ACTGACCCGC 

1351 gatggtg;^ ACGTTGATGC 

1401 ATTGCACGCC GGCGTGTTCC 

14 51 GAGAAGTCAT GGCGATTGCC 

1501 AAGAGCAAAG TGTGGAATAC 

1551 CCTAATTGAA GTGGCGAAAG 

1601 AATCACGCGG TGCGCACGCT 

1651 AACTGGATGA AACATACGCT 

17 01 CAAACCGGTG CACACCAAGC 

1751 AGCGCGTTTA TTGA 



CAACGTATCG AGCGTTTGGA CAATCAAACT 
ATTGCGCCGC GAACTGCAAC GCTCCGTACA 
GTACTGATGA GATTCTGAGC AAAGGCGTTC 
GAGCGTGTGA AACGTACCGA AATCAAAGAC 
CGCGCGTATC GAGGCTTTGG AATTGGATAA 
CGACTTTGGT GTCTGCCGT^ GCACGTAAAG 
TCAGACGACC ATCCTGAGCG CGATGATGAA 
GTACCATTCA GATGCCAATA CCTTGTCCTA 
CTTTGAGCGT GGAATACATC AAACCGGCCA 



This corresponds to the amino acid sequence <SEQ ID 66; ORF 010.a>: 

aOlO.pep 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDVVGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDWS RAMAMEIYEG 

301 RGCGBCNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEVWP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

401 TNSLLDLWF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 

4 51 DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 

501 KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDDHPERDDE 

551 NWMKHTLYHS DANTLSYKPV HTKPLSVEYI KPAKRVY* 



moio/aolo 98.7% identity over a 23 1 aa overlap 



10 20 30 

mOlO . pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

lllllllllltllllllllllillllllllll Ml 
aOlO MGFPVRKFDAVIVGGGGAGLRAXLQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 
10 20 30 40 50 60 

40 50 60 70 80 90 

mOl 0 . pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKI YQRPFG 

I M I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I t M I I i I I I I I j I I 1 M I I I I I t I I 
aOlO QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
70 80 90 100 110 120 

100 110 120 130 140 150 

mOlO . pep GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

I I I t I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I [ I I 

aOlO GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 
130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO . pep TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 

IMIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIMIII Illllll 

aOlO TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 
190 200 210 220 230 240 

220 230 
mOlO.pep FQPTGVAGAGVLITE 
I : I I M I I I I ] M I I 

aOlO FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAMEIYEG 
250 260 270 280 290 300 



Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted QRF from N. gonorrhoeae 

ORF 010 shows 98.7% identity over a 231 aa overlap with a predicted ORF (ORF OlO.ng) 
from A^. gonorrhoeae: 

mO 1 0 . pep/gO 1 0 . pep 

10 20 30 

^010 . pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

I I M I I I I I I i I I I I I I I I I I i I I I I I j I I I I III 
gOlO ^^GFPVRKFDAVIVGGGGAGLRAALQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 
10 20 30 , 40 50 60 

40 50 60 70 80 90 

mOlO . pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

III! lllllllllllltlllllllM illlllllllltlllllllllll 

gOlO QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
"70 80 90 100 110 120 

100 110 120 130 140 150 

mOlO . pep GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

> I illlllllllllllllMllllllllllllliiiiiiiiii 

gOlO GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 
130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO . pep TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 

■NIIIIIIMII MilllllllllllllllllMllltM Illllllll 

gOlO TAMEMETGEVYIFHAICAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 
190 200 210 220 230 240 

220 230 
mOlO .pep FQPTGVAGAGVLITE 
I : I I II I I I II II II 

gOlO FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAMEIYEG 
250 260 270 280 290 300 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 67>: 

gOlO-l.seq.. 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 

351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

401 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

451 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 

651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

701 AAGATATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAACGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTGGTTTCA CGCGCGATGG CGATGGAAAT CTATGAAGGT 

901 CGCGGCTGTG GTAAAAACAA AGACCACGTC TTACTGAAAA TCGACCATAT 

951 CGGTGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTTGC CGGTATCGAT CCGATTAAAG ACCCGATTCC GGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCCG ACCAATTATC ACGGTGAAGT 

1101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 
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1201 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 68; ORF 010-1. ng>: 

gOlO-l .pep 

1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEVWP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

401 TNSLLDLWF RPTPR* 

gOlO-1 / P10444 

8p|P10444|DHSA_ECOLl SUCCINATE DEHYDROGENASE FIAVOPROTEIN SUBUNIT 

gnl|PID|dl015210 (D90711) Succinate dehydrogenase, flavoprotein [Escherichia coli] gi|l786942 
(AE000175) succinate dehydrogenase flavoprotein subunit [Escherichia coli) Length « 588 

Score = 1073 (495.6 bits). Expect ^ 6.7e-169, Sum P(2) = 6.7e-169 

Identities = 191/303 (63%) , Positives = 238/303 (78%) 

Query: 1 MGFPVRKFDAVIVXXXXXXXXXXXXXSKSCmCAVLSKVFPTRSHTVAAQGGISASLGNV 60 

M PVR+FDAV++ S+SG CA+LSKVFPTRSHTV+AQGGI+ +LGN 

Sbjct; 1 MKLPVREFDAWIGAGGAGMRAALQISQSGQTCALLSKVFPTRSHTVSAQGGITVALGNT 60 

Query: 61 QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 120 

ED W+WHMYDTVKGSD++GDQDArE+MC+ PEA++ELEHMG+PF R++ G+IYQRPFG 
Sbjct: 61 HEDNWEWHMYDTVKGSDYIGDQDAIEYMCKTGPEAILELEHMGLPFSRLDDGRIYQRPFG 120 

Query: 121 GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDL I RDENGDWGV 180 

G + G R A ADRTGHA+LHTLYQQN++ +T F EW A DL+++++G WG 

Sbjct: 121 GQSKNFGGEQAARTAAAADRTGHALLHTLYQQNLKNHTTIPSEWYALDLVKNQDGAWGC 180 

Query: 181 TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAG IPLEDMEFWQ 240 

TA+ +ETGEV F A+A + ATGG GRIY S+TNA++NTGDG+G+ RAG+P++DME WQ 
Sbjct: 181 TALCIETGEVVYFKARATVIATGGAGRIYQSTTNAHINTGDGVGMAIRAGVPVQDMEMWQ 240 

Query: 241 FHPTGVAGAGVL I TEGVRGEGG I LLNADGERPMERYAPTVKDLASRDWS RAMAMEIYEG 300 

FHPTG+AGAGVL+TEG RGEGG LUJ GERFMERYAP KDLA RDW+R++ +EI EG 
Sbjct: 241 FHPTGIAGAGVLVTEGCRGEGGYLLNKHGERFMERYAPNAKDIiAGRDWARSIMIEIREG 300 

Query: 301 RGC 303 
RGC 

Sbjct: 301 RGC 303 

Score = 249 (115.0 bits), Expect = 6.7e-169, Sum P(2) = 6.7e-169 
Identities « 53/102 (51%) , Positives » 62/102 (60%) 

Query: 309 HVLLKIDHIGAEKIMEKLPGIREISIQFAGXXXXXXXXXXXXTTHYMMGGIPTNYHGEW 368 

H LK+DH+G E + +LPGI E+S FA T HYMMGGIPT G+ + 

Sbjct: 310 HAKLKLDHLGKEVLESRLPGILELSRTFAHVDPVKEPIPVIPTCHYMMGGIPTKVTGQAL 369 

Query: 369 VPQGDEYEVPVKGLYAAGECACASVHGANRLGTNSLLDLWF 410 

+V V GL+A GE AC SVHGANRLG NSLLDLWF 
Sbjct: 370 TVNEKGEDVWPGLFAVGEIACVSVHGANRLGGNSLLDLWF 411 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 69>: 

raOlO-l.seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCAGCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAg 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

401 CCTGTGCGGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 
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451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 



CAACAAAACG 
AGATTTGATT 
AAATGGAAAC 
GCTACCGGCG 
GAATACCGGC 
AAGACATGGA 
GTGTTGATTA 
CGACGGCGAA 
CTTCTCGCGA 
CGCGGCTGCG 
CGGCGCAGAA 
TTCAGTTCGC 
ACTACCCACT 
TGTCGTTCCG 
CGGCAGGTGA 
ACCAACTCCC 
CATGATTAAA 
ATGCAGGTGA 
GATGGTGAAA 
ACTGCACGCC 
GAGAAGTCAT 
AAGAGCAAAG 
CCTGATTGAA 
AATCACGCGG 
AACTGGATGA 
CAAACCGGTG 
AGCGCGTTTA 



TCCGTGCCAA 
CGTGATGAAA 
CGGCGAAGTT 
GCGGCGGTCG 
GATGGTTTGG 
ATTCTGGCAA 
CCGAAGGCGT 
CGCTTTATGG 
CGTTGTTTCC 
GTAAAAACAA 
AAAATTATGG 
CGGTATCGAT 
ATATGATGGG 
CAAGGTGAAG 
GTGCGCTTGT 
TGTTGGACTT 
TTCATCAAAG 
GTTGACCCGC 
ACGTTGATGC 
GGCGTGTTCC 
GGCGATTGCC 
TGTGGAATAC 
GTGGCGAAAG 
TGCGCACGCT 
AACATACGCT 
CACACCAAGC 
TTGATGA 



TACGCAATTC 
ACGGCGATGT 
TATATTTTCC 
TATTTATGCG 
GTATTTGTGC 
TTCCACCCGA 
ACGCGGCGAG 
AACGCTATGC 
CGCGCGATGG 
AGACCATGTC 
AAAAACTGCC 
CCGATTAAAG 
CGGCATTCCG 
ATTACGAAGT 
GCTTCCGTAC 
GGTGGTATTC 
AGCAAAGCGA 
CAACGTATCG 
ATTGCGTCGC 
GTACTGATGA 
GAGCGTGTGA 
CGCGCGTATC 
CGACTTTGGT 
TCAGACGACC 
GTACCATTCA 
CTTTGAGCGT 



TTTGTGGAAT 
CGTCGGCGTA 
ACGCTAAAGC 
TCTTCTACCA 
GCGTGCAGGT 
CCGGCGTGGC 
GGCGGTATTC 
GCCGACCGTA 
CGATGGAAAT 
TTACTGAAAA 
GGGCATCCGC 
ACCCGATTCC 
ACCAATTACC 
GCCTGTAAAA 
ACGGTGCGAA 
GGTAAAGCTG 
CTGGAAACCT 
AGCGTTTGGA 
GAACTGCAAC 
GATTCTGAGC 
AACGTACCGA 
GAGGCTTTGG 
GTCTGCCGAA 
ATCCTGAGCG 
GATATCAATA 
GGAATACATC 



GGACGGCACA 
ACCGCCATGG 
TGTGATGTTT 
ATGCCTATAT 
ATCCCGTTGG 
GGGTGCGGGC 
TGTTGAATGC 
AAAGACTTGG 
CTACGAAGGT 
TCGACCATAT 
GAGATTTCCA 
CGTTGTGCCG 
ACGGCGAAGT 
GGTCTGTATG 
CCGCTTGGGT 
CCGGCGACAG 
TTGCCTGCTA 
CAACCAAACC 
GCTCTGTACA 
AAAGGCGTTC 
AATCAAAGAC 
AATTGGATAA 
GCACGTAAAG 
CGATGATGAA 
CCTTGTCCTA 
AAACCGGCCA 



This corresponds to the amino acid sequence <SEQ ID 70; ORF 010-1>: 

mOlO-l.pep. . 

MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 
GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 
HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 
QQNVRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 
ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 
VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDWS RAMAMEIYEG 
RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 
TTHYMMGGIP TNYHGEVWP QGEDYEVPVK GLYAAGECAC ASVHGANRLG 
TNSLLDLWF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 
DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 
KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDDHPERDDE 
NWMKHTLYHS DINTLSYKPV HTKPLSVEYI KPAKRVY* 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



mOlO-l / gOlO-1 99.5% identity in 410 aa overlap 



10 20 30 40 50 60 

mOlO-l .pep MGPPVRKFDAVIVGGGGAGIJVAM^LSKSta^CAVLSKVPPTRSHT^^ 

IIMllllllltnitllllllllllltnilllllllllllllllt Ilttltll 

gOlO-1 MGETVRKHll^VlVGGGGAGIiRAALQI^KSGI«CAVLSK\^TRSHTV^ 

10 20 30 40 50 €0 

70 80 90 100 110 120 

mO 10 - 1 . pep QEDRWDWHMTOTVKGSDWLGDQimEEMCRAAPEAVIELEHMGMPFDRVESGKiyQRPFG 

tllitlllllllllllliniMlllt IMMIIIIIIilllltllllMI 

gOlO-1 QEDRWDWHMYDTVIUSSDWLGDQPAIEIMCRAAPEAVIELEHMGMPFDKVESGKIYQRPFG 

70 80 90 100 110 120 

130 140 150 160 170 180 

mOlO-1 .pap CHTAEHGKRAVERACAVADRTGHAMLirrLYQGimWNTQFFVETraAQDLIR^ 

IINIinilllllllllllllllllll IIIIIIIIIIIIIIMIIIMMIIII 

gOlO-1 GHTAEHGKRAVERACAYADRTGHi^HTLYQQNVRJttn^FFVEWrAQDL 

130 140 150 160 170 180 

190 200 210 220 230 240 

mOlO-1 .pep TAMEMETGEVYIPHAKAVMFATGGGGRIYASSTNAYMNTGDGl^ICARAGIPIilDME 

■111 It I till llllltllllllllllllliritiMltltllMlilllltl 

gOlO-1 ™«EMETGEVyiFHAKAVMFATGGGGRIYAS3TNAYMNTCDGLGICARAGIPLEI^ 
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190 200 210 220 230 240 

250 260 270 280 290 300 

mOlO-1 . pep FHPTGVAGAGVLITEGVRGE6GXIXNADG£RET4ERYAPTVKDIJISRDWSRAM^ 

IllllllllllllllllllllllllllllllllllllllUlllllllllltltllltll 
gOlO-1 FHPT(^A(;ACra.IT£GVRG&GGXIJ^KADGBR£tlBRYAPTVKDIA^ 

250 260 270 280 290 300 



310 320 330 340 350 360 

mOlO-1 .pep RGCGKNKDinnXKIDHIGAEKIMEKLPGIREISIQPAGIDPIKDPIPVVPTTHYMMCGIP 
llllllllllllllllltllllllllllllltlltllllllllllllllllllllllill 
gOlO-1 RGCGKNKDHVLLKIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIFVVPTTHXMMG6IP 

310 320 330 340 350 360 



370 380 390 400 410 420 

mOlO-1 . pep TNYHGEVVVPQGEDYEVPVKGLYAAGECACASVHGAMRIA3TNSZJ:J)LVVFGKA2^ 

illilll)llit::|ltllMllllllllltlltllinill)ltlllll 
gOlO - 1 TNYHGEVVVPQGDE YEVPVKGLYAAGECACASVHGANRI^TNSLLDLVV^ 

370 380 390 400 410 



430 440 450 460 470 480 

mOlO-1 .pep FZKEQSDWKFLPAKAGELTRQRIEIOiDNQTDGENVDJaJUlELQRSVQLHAGVFRT^ 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 71>: 

aOlO-l.seq. . 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

401 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

451 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

701 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 

1301 ATGCCGGCGA ACTGACCCGC CAACGTATCG AGCGTTTGGA CAATCAAACT 

1351 GATGGTGAAA ACGTTGATGC ATTGCGCCGC GAACTGCAAC GCTCCGTACA 

1401 ATTGCACGCC GGCGTGTTCC GTACTGATGA GATTCTGAGC AAAGGCGTTC 

14 51 GAGAAGTCAT GGCGATTGCC GAGCGTGTGA AACGTACCGA AATCAAAGAC 

1501 AAGAGCAAAG TGTGGAATAC CGCGCGTATC GAGGCTTTGG AATTGGATAA 

1551 CCTAATTGAA GTGGCGAAAG CGACTTTGGT GTCTGCCGAA GCACGTAAAG 

1601 AATCACGCGG TGCGCACGCT TCAGACGACC ATCCTGAGCG CGATGATGAA 

1651 AACTGGATGA AACATACGCT GTACCATTCA GATGCCAATA CCTTGTCCTA 

1701 CAAACCGGTG CACACCAAGC CTTTGAGCGT GGT^TACATC AAACCGGCCA 

1751 AGCGCGTTTA TTGA 



This corresponds to the amino acid sequence <SEQ ID 72; ORF 010-l.a>: 

aOlO-l.pep. . 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 
51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 
101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 
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151 
201 
251 
301 
351 
401 
451 
501 
551 



QQNVRAKTQF 
ATGGGGRIYA 
VLITEGVRGE 
RGCGKNKDHV 
TTHYMMGGIP 
TNSLLDLWF 
DGENVDALRR 
KSKVWNTARI 
NWMKHTLYHS 



FVEWTAQDLI 
SSTNAYMNTG 
GGILLNADGE 
LLKIDHIGAE 
TNYHGEWVP 
GKAAGDSMIK 
ELQRSVQLHA 
EALELDNLIE 
DANTLSYKPV 



191 



RDENGDWGV 
DGLGICARAG 
RFMERYAPTV 
KIMEKLPGIR 
QGDEYEVPVK 
FIKEQSDWKP 
GVFRTDEILS 
VAKATLVSAE 
HTKPLSVEYI 



mOlO-l / aOlO-1 99.3% identity in 587 aa overlap 



TAMEMETGEV 
•IPLEDMEFWQ 
KDLASRDWS 
EISIQFAGID 
GLYAAGECAC 
LPANAGELTR 
KGVREVMAIA 
ARKESRGAHA 
KPAKRVY* 



YIFHAKAVMF 
FHPTGVAGAG 
RAMAMEIYEG 
PIKDPIPWP 
ASVHGANRLG 
QRIERLDNQT 
ERVKRTEIKD 
SDDHPERDDE 



10 20 30 40 50 60 

aOl 0 - 1 . pep MGFPVRKFDAVI VGGGGAGLRAXLQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 

llllllllllllilllllllll lllllllllllllllllllllllllllllllllllll 
aO 1 0 - 1 MGFPVRKFDAVI VGGGGAGLRAALQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 

10 20 . 30 40 50 60 



70 80 90 100 110 120 

aO 1 0 - 1 . pep QEDRWDWHMYDTVKGSDWLGDQDAI EFMCRAAPEAVI ELEHMGMPFDRVESGKI YQRPFG 

llllllllllllllllllllllllllllllilllllllllllllllllllllllllllll 
mOlO - 1 QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVI ELEHMGMPFDRVESGKI YQRPFG 

70 80 90 100 110 120 

130 140 150 160 170 180 

aOlO - 1 . pep GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

raO 1 0 - 1 GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 
130 140 150 160 170 180 

190 200 210 220 230 240 

aOl 0 - 1 . pep TAMEMETGEVYIFHAKAVMFATGGGGRI YASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 

illllllllllMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

mOlO- 1 TAMEMETGEVYIPHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 
190 200 210 220 230 240 

250 260 270 280 290 300 

aOlO-1 .pep FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDIiASRDWSRAMAMEIYEG 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

mOlO - 1 FHPTGVAGAGVLITEGVRGEGGILIJJADGERFMERYAPTVKDIASRDVVSRAMAMEIYEG 
250 260 270 280 290 300 

310 320 330 340 350 360 

aOlO - 1 . pep RGCX3KNKDHVLLKIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIPWPTTHYMMGGIP 

IIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

mOlO-l RGCGKNKDHVLLKIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIPWPTTHYMMGGIP 
310 320 330 340 350 360 

370 380 390 400 410 420 

aOl 0 - 1 . pep TNYHGEVWPQGDEYEVPVKGLYAAGECACASVHGANRLGTNSLLDLWFGKAAGDSMIK 

lllllllllllhHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

mO 1 0 - 1 TNYHGEVWPQGEDYEVPVKGLYAAGECACAS VHGANRLGTNSLLDLWFGKAAGDSMI K 

370 380 390 400 410 420 

430 440 450 460 470 480 

aO 1 0 - 1 . pep F I KEQSDWKPLPANAGELTRQR I ERLDNQTDGENVDALRRELQRSVQLHAGVFRTDEILS 

llllllilllllllllllllllllllllllllllllllllllllllllllllllllllll 
niO 1 0 - 1 FIKEQSDWKPLPANAGELTRQRIERLDNQTDGENVDALRRELQRSVQLHAGVFRTDEILS 
430 440 450 460 470 480 

490 500 510 520 530 540 

aOlO - 1 . pep KGVREVMAIAERVKRTEIKDKSKVWNTARIEALELDNLIEVAKATLVSAEARKESRGAHA 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

mOl 0 - 1 KGVREVMAI AERVKRTEIKDKSKVWNTARI EALELDNLIEVAKATLVSAEARKESRGAHA 

490 500 510 520 530 540 



aOlO-1 .pep 



550 560 570 580 

SDDHPERDDENWMKHTLYHSDANTLSYKPVHTKPLSVEYIKPAKRVYX 

lllilllllllllllllllll lllllllllllllltlllllllllll 
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mOlO - 1 SDDHPERDDENWMKHTLYHSDINTLSYKPVHTKPLSVEYIKPAKRVYX 
550 560 570 580 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 73>: 

gOll . saq 

1 ATGAAGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACATCATGA 

101 GCCTGAAAAC CCGCCTTACC GAAGATATGA AAACCGCGAT GCGCGCCAAA 

151 GATCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAATGCCG CCGTCAAACA 

201 GTTTGAAGTA GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

251 TCCTGACCAA AATGGTCAAA CAGCGCAAAG ACGGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGACGT 

351 GCTGCACCGC TACCTGCCGC AAATGCTCTC CGCCGGCGAA ATCCGCACCG 

401 CCGTCGAAGC AGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

451 GGCAAAGTGA TGGTCGTATT GAAAAcccGC CTCGCCGGCA AAGccgATAT 

501 GGGCGAAGTC AACAAAATCT TGAAAAccGt aCTGACCGCC tga 



This corresponds to the amino acid sequence <SEQ ID 74; ORF 01 l.ng>: 

gOll . pep 

1 MKTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKTRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDGAKIYT 

101 EAGRQDLADK ENAEIDVLHR YLPQMLSAGE IRTAVEAAVA ETGAAGMADM 

151 GKVMWLKTR LAGKADMGEV NKILKTVLTA * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 75>: 

mOll.seq (partial) 

1 ATGAGGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACATCATGA 

101 GCCTGAAAAT CCGCCTTACC GAAGACATGA AAACCGCGAT GCGCGCCAAA 

151 GACCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAACGCCG CCGTCAAACA 

201 GTTTGAAGTG GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

251 TCCTGACCAA AATGGTCAAA CAGCGAAAAG ACAGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGAGGT 

351 ACTGCACCGC TACCTTCCCC AAATGCTTTC CGCCGGCGAA ATCCGTACCG 

401 AGGTCGAAGC TGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

4 51 GGTAAAGTCA TGGGGCTGCT GAAAACCCGC CTCGCAGGTA AAGCCGA. . . 



This corresponds to the amino acid sequence <SEQ ID 76; ORF 01 1>: 

mO 11 . pep (partial ) 

1 MRTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKIRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDSAKIYT 

101 EAGRQDLADK ENAEIEVLHR YLPQMLSAGE IRTEVEAAVA ETGAAGMADM 

151 GKVMGLLKTR LAGKA 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 01 1 shows 95.8% identity over a 165 aa overlap with a predicted ORF (ORF 01 l.ng) 
from A^. gonorrhoeae: 

mOll/gOll 



10 20 30 40 50 60 

mOll . pep MRTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKIRLTEDMKTAMRAKDQVSLGTIRL 

I'llll IIIIIIIIIIIIIIIIIIIIIMII I IIIIIIIIIIIIIMIl 

gOll MKTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKTRLTEDMKTAMRAKDQVSLGTIRL 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOl 1 . pep INAAVKQFEVDERTEADDAKITAILTKMVKQRKDSAKIYTEAGRQDLADKENAEIEVLHR 

"HIIIMIIIIMIIIIIIIMIIIIIII 1111:1 Ill till I II II 11:1 j|| 

gOll INAAVKQFEVDERTEADDAKITAILTKMVKQRKDGAKIYTEAGRQDLADKENAEIDVLHR 
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70 80 90 100 110 120 

130 140 150 160 

mOll . pep YLPQMLSAGEIRTEVET^VAETGAAGMADMGKVMGLLKTRLAGKA 
I I I I I I I I I I I I I I I I I I t I I 1 I I I M I I I I I I : I i I I I I I I I 
gOll YLPQMLSAGEIRTAVEAAVAETGAAGMADMGKVMWLKTRLAGKADMGEVNKILKTVLTA 

130 140 150 160 170 180 

gOll X 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 77>: 

g012.seq 

1 ATGCTCGCCC GTCGCTATTT TTTCAATATC CAACCCGGGG CGGTTTTCAC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGCCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC MTATCATGT TTTTTCAACa 

201 gGcggTGGAT ATTCGgcact tccgCcacca cacccaccga accgatgacc 

251 gcaaacggaG CGGAAACAAT TTTATCCGCc acacacgcca tcatatagcc 

301 gcCGCTTGCC GCGACCTTAT CGAcggcgac ggTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

401 CCGCCCGGAC TTTCCAATCT GAGCAGAACC TCATCTTCAG GCTTGGCAAT 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GCAGATTTCT CCCCGCCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CTTTTTTTTC CTGATGTTTT GTCTCTTCCT ' 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 78; ORF 012.ng>: 

9012.pep 

1 MLARRYFFNI QPGAVFTDKL LEQLMRFLQF LPEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRHFRHHTHR TDDRKRSGNN FIRHTRHHIA 

101 AACRDLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPAL LQTLFLCFGF 

201 RLFLFLFFFF LMFCLFLA* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 79>; 

m012 . seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCCACCA CACCCACCGA ACCGACAATC 

251 GCAAACGGAG CGGAAGCAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 GCCGCTCGCn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

351 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

401 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 51 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

501 nnnnnnnnnn nnnnnnnnnC AACACAAAAA GGCGTGATTT nTGCGTTTCG 

551 GCAGATTTCT CCCCACCCTC CTTCAAACGT TTTTCcTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTGT GCCTCTTCCC 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 80; ORF 012>: 

m012.pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

101 AARXXXXXXX xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx XXXQHKKA*F XRFGRFLPTL LQTFFLCFGF 

201 RLFLFLFLFF LMLCLFPA* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 81>: 

a012 . seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCTACAA CACCCACCGA ACCGACAATC 

251 GCAAACGGAG CGGAAACAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 ACCGCTCGCC GCCACCTTAT CGACGGCGAC GGTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

401 CCGCCCGGAC TTTCCAATCT AAGCAGAACC TCATCTTCAG GCTTGGCAAT 

451 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GAAGATTTCT CCCCACCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTTT GCCTCTTCCC 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 82; ORF 012.a>: 

a012.pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 

101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS KQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 

201 RLFLFLFLFF LMFCLFPA * 

moi2/aoi2 64.2% identity over a 21 8 aa overlap 

10 20 30 40 50 60 

m012 . pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 

1 1 1 1 1 1 1 1 1 1 n 1 11 1 1 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 

a012 MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
10 20 30 40 50 60 

70 80 90 100 110 120 

m012 . pep NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 
I I I M I I I I I I I I I I :: I I I I I f I I I I I : I I I I I I I I M I : I I : 
a012 NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAQT 
70 80 90 100 110 120 

130 140 150 160 170 180 

in012 , pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 

— • Mint 

a012 PKLRSRQTVTVNHAARTFQSKQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 

190 200 210 219 

m012 .pep XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I I I I M I I I I I I : I I I I I I I t I I I I I I I t I I : I I I I I I 
a012 LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 

190 200 210 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF fix>m N. gonorrhoeae 

ORF 012 shows 58.7% identity over a 218 aa overlap v^th a predicted ORF (ORF 012.ng) 
fi*om K gonorrhoeae: 

in012/g012 

10 20 30 40 50 60 

in012.pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFTVRRHHIHI 

Nil :|:lll II:: II I I II I II II I I i IIIMIIIIIII Iltlllllll 

g012 MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIFTHKSNRALKFARRHHIHI 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m012 . pep NIMFFQQAVDIRyFRHHTHRTDNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 
I I I I M I I t I I I : I I I I I I I I I : I I M I : I I I I I I I I I I : I I : 
g012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLXDGDGQRNIAFAQT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m012 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 

: : : : I I I I I I 

g012 PKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 

190 200 210 219 

mai2 . pep XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I I I I I I I : I I I I : I I i I I I I I I I I I I : j I I ): I I I It 
g012 LRFGRFLPALLQTLFLCFGFRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 83>: 

m012-l.seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCCACCA CACCCACCGA ACCGACAATC 

251 GCAAACGGAG CGGAAGCAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 GCCGCTCGCC GCCACCTTAT CGACGGCGAC GGTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CyTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

401 CCGCCCGGAC TTTCCAATCT GAGCAGAACC TCATCTTCAG GCTTGGCAAT 

451 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GCAGATTTCT CCCCACCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTTT GCCTCTTCCC 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 84; ORF 012-1>: 

m012-l.pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

101 AARRHLIDGD GQRNIAFAQT XKLRSRQTVT VNHAARTFQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 
201 RLFLFLFLFF LMFCLFPA* 



m012-l/g012 91.7% identity in 218 aa overlap 



10 20 30 40 50 60 

ra012-l . pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
flil ll::|lillllllltlll I I I I i I I I I I I M I I I I I I I I I I I I i 1 I 

g012 MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIFTHKSNRALKFARRHHIHI 

10 20 30 40 50 60 



70 80 90 100 110 120 

ra012-l.pep NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAQT 
i i I I I I I I I I I I : I I I I I I I I I : I I I I 1: I I I I I I i I I I : I I I I I 1 I I I I I I I I I i I I 
g012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLIDGDGQRNIAFAQT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m012-l . pep XKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 
I I I I I I M I I I I I I I I I I I I I I i I I I I I I I I I 1 I I I I i I I I I M I I M I I I I I I t I M t 
g012 PKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 
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130 



140 



150 



160 



170 



180 



190 200 210 219 

m012-l . pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
I I I I I I I I : I I I I I I I I t I I I I I I I I I : I I I I I I I I II 
g012 LRFGRFLPALLQTLFLCFGFRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was 

a012-l . seq 

ATGCTCGCCC GTTGCCACTT 
TGACAAACTG CTTGAACAAC 
TTCTGTTTGC CCTTTTCCGT 
AAATTCGCCC GCCGTCATCA 
GGCGGTGGAT ATTCGGTACT 
GCAAACGGAG CGGAAACAAT 
ACCGCTCGCC GCCACCTTAT 
CGCGCAAACG CCTAAGCTGC 
CCGCCCGGAC TTTCCAATCT 
CAAAAGCACC GCCGTAATCT 
ACAAATCGCC GTCAAAATCC 
GAAGATTTCT CCCCACCCTC 
CGCCTTTTCC TTTTTCTTTT 
CGCTTAA 



identified in N. meningitidis <SEQ ID 85>: 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



CCTCAATATC 
TGATGCGTTT 
ATTTTCACCC 
CATCCACATC 
TCCGCTACAA 
TTTATCCGCC 
CGACGGCGAC 
GAAGCCGCCA 
AAGCAGAACC 
CATGACGCAA 
AACACAAAAA 
CTTCAAACGC 
CCTCTTTTTC 



CAATTGAGGG 
CCTCCAGTTC 
ATAAAAGTAA 
AATATCATGT 
CACCCACCGA 
ACACACGCCA 
GGTCAGCGGA 
AACCGTAACC 
TCATCTTCAG 
GGATTCTACG 
GGCGGGATTT 
TTTTTCTCTG 
CTGATGTTTT 



CGGTTCTCGC 
CTGTCGGAAT 
CCGTGCGCTT 
TTTTTCAACA 
ACCGACAATC 
TCATATAACC 
ATATTGCGTT 
GTGAACCACG 
GCTTGGCAAT 
GCGTGTGCAT 
TTGCGTTTCG 
CTTTGGCTTC 
GCCTCTTCCC 



This corresponds to the amino acid sequence <SEQ ID 86; ORF 012-1. a>: 

a012-l.pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 
51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 
101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS KQNLIFRLGN 
151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 
201 RLFLFLFLFF LMFCLFPA* 



a012-l/m012-l 



a012-l.pep 



m012-l 



97.2% identity in 218 aa overlap 

10 20 30 40 50 60 

MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
I I I M M I I I I I I I M M I I II I II I II I II I II I I II I II I I I I II II I t M II I II II 
MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
10 20 30 40 50 60 



70 80 90 100 110 120 

a012-l.pep NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAQT 
I t I I I I I 1 I I I I I I t:: I I I i I I I I I I I : I I I I I I II I M: I II I I I I II I I I II I I I [ I 
m012-l NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAQT 

70 80 90 100 110 120 

130 140 150 160 170 180 

a012-l . pep PKLRSRQTVTVNHAARTFQSKQNLIFRLGKQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

IIIIMIIIIIIIhlllillMllllllllMlMilllllllllllMIIII 

m012-l XKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 



190 200 210 219 

a012-l .pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
I I I t I M I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I t 
mO 1 2 - 1 LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 87>: 

g013. seq 

1 aTgcctttga ccatgctgtg cagcaGGAcg tGCGGTTtgt tcataataca 

51 gtCcgaccGG AAAagcggAG GAAaCGCAGT GCCGCGCCCT TCCCCTTTCT 

101 TGCCGTGGCA GGCGATGCag tTgGATTCGT ACACTTTTTG CCCTTTtGtc 
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151 atgatGCTgt tgtcggCGGC AGAAGCgGCG GcgCAGAGGC AGCACAAGAT 

201 GAAGGCGGTC GGCAGTCGGG TTGTGTtcat tGgcgTTTCC cctaatgttt 

251 tgaaaccttg ttttttgatt Ttgcctttac ggggtgaaaa gtttttTtgg 

301 cccaaatccg gaatttag 

This corresponds to the amino acid sequence <SEQ ID 88; ORF 013.ng: 

gOlS.pep 

1 MPLTMLCSRT CGLFIIQSDR KSGGNAVPRP SPFLPWQAMQ LDSYTFCPHV 
51 MMLLSAAEAA AQRQHKMKAV GSR VVFIGVS PNVLKPCFLI LPLRGEKFFW 
101 PKSGI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 89>: 

m013 . seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAAGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTCATGTTTG 

251 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT TTTGCCGAAT 

301 CAGTAG 

This corresponds to the amino acid sequence <SEQ ID 90; ORF 013>: 

m013.pep 

1 MPLTMLCSST CGFFMMKSER XSGGNMVPRP SPFLPWQATQ LDSYTFCPFV 
51 MMLLSAAEAA AQKQPKTRAV GS RVVFIGVS FMFETLLLI L RSGXKIFLPN 
101 Q* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 91>: 

a013. seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAGGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTAATGTTTG 

251 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT CTTGCCGAAT 

301 CGGTAG 

This corresponds to the amino acid sequence <SEQ ID 92; ORF 013.a>: 

a013.pep 

1 MPLTMLCSST CGFFMMKSER *SGGNMVPRP SPFLPWQATQ LDSYTFCPFV 
51 MMLLSAAEAA AQRQPKTRAV GSR VVFIGVS LMFETLLLIL RSG*KIFLPN 
101 R* 

moi3/aOi3 97.0% identity over a 101 aa overlap 

10 20 30 40 50 60 

m013 . pep MPLTMLCSSTCGFEM1KSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 
IMMIIIIIitllltllillllllllllllllllMlltllllMIIIIMItllllli 
a013 MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 

10 20 30 40 50 60 

70 80 90 100 

in013 . pep AQKQPKTRAVGSRVVFIGVSFMFETLLLILRSGXKIFLPNQX 
I I : I I I I I I I I i I I I I i I I I : I I I I I I I I I I i I I I I I I I i : I 
a013 AQRQPKTRAVGSRWFIGVSLMFETLLLILRSGXKIFLPNRX 

70 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF fi-om K gonorrhoeae 

ORF 013 shows 73.3% identity over a 101 aa overlap with a predicted ORF (ORF 013.ng) 
from N. gonorrhoeae: 
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m013/g013 

10 20 30 40 50 60 

m013 . pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 
Itlllill |||| IIIIIIMIIII I I I I I I t I I I I I I I I I I I I I I 

g013 MPLTMLCSRTCGLFIIQSDRKSGGNAVPRPSPFLPWQAMQLDSYTFCPFVMMLLSAAEAA 

10 20 30 40 50 60 

70 80 90 100 

m013.pep AQKQPKTRAVGSRWFIGVSF-MFETLLLILR-SGXKIFLPNQX 

I I : I ): I I I I I I I I I I I I : : : : I I I I I : I I : 
g013 AQRQHKMKAVGSRVVFIGVSPNVLKPCFLILPLRGEKFFWPKSGIX 
70 80 90 100 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ E) 93>: 

gOlS . seq 

1 ATGCAGTATC TGATTGTCAA ATACAGCCAT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTTTTTTCCT ACTTTGGAAA AATCCAGAAA 

101 AGCCCTTGGT CGGCTTTTGG AAAGCACTGC CCCACCTCAA CGACACGATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAG ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGCA CAAAAATCCT GCTCCTGTTC GCCTACATCG 

251 CACTGGGCAT GGTAATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

301 ACCGTTTACC TGCTCGCTAT GTGTTGCATC GCCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCAT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 94; ORF 015.ng>: 

g015 .pep 

1 MQYLIVKYSH Q IFVTITILV FNIRFFLLW K NPEKPLVGFW KALPHLNDTM 

51 LLFTGLWIiMK ITHFSPFNAP WLGT KILLLF AYIALGMVMM RAR PRSTKFY 

101 TVYLLAMCCI ACIVYLAKTK VLPF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 95>: 

mOlS . seq (partial) 

1 . .AAAATCAG7UV AAGCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA 

51 CGACACGATG CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT 

101 TCTCCCCGTT CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC 

151 GCCTATATCG CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC 

201 CAAGTTCTAC ACCGTTTACC TGCTCGCCAT GTGTTGCGTC GCCTGCATCG 

251 TTTACCTTGC CAAAACCAAA GTCCTGCCTT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 96; ORF 015: 
m0l5.pep (partial) 

1 . .KIRKALAGFW KALPHLNDTM LLFTGLWLMK ITHFSPFNAP WLGT KIT.T.T.L 
51 AYIALGMMMM RAR PRSTKFY TVYLLAMCCV ACIVYLAKTK VLPF* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 97>: 

a015 . seq 

1 ATGCAGTATC TGATTGTCAA ATACAGCCAT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTGTTTTCNT ACTTTGGAAA AATCCAGAAA 

101 AGCCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA CGACACGATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC GCCTATATCG 

251 CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

301 ACCGTTTACC TGCTCGCCAT GTGTTGCGTC ACCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCTT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 98; ORF 015.a>: 

aOlS.pep 
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1 MQYLIVKYSH Q IEVTITILV FNIRVFXL WK NPEKPLAGFW KALPHLNDTM 
51 LLFTGLWLMK ITHFSPFNAP WLGTK ILLLL AYIALGMMMM RA RPRSTKFY 
101 TVYLLAMCCL TCIVYLAKTK VLPF* 



m0i5/aoi5 96.7% identity over a 91 aa overlap 



10 20 30 

m015 . pep KIRKALAGFWKALPHLNDTMLLFTGLWLMKITH 

I I I I I I I I I I I I I I I I 1 I M I I I M I I I I 
a015 LIVKYSHQIFVTITILVFNIRVFXLWKNPEKPLAGFWKALPHLNDTMLLFTGLWLMKITH 
10 20 30 40 50 60 

40 50 60 70 80 90 

inOlS . pep FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCVACIVYLAKTKVLP 

I i I I I I I I I I I I I I I I I I I I I I i 1 I I I I I I I I I I I ! I t I I I :: I I I I I I I I I I I I 

a015 FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCLTCIVYLAKTKVLP 

70 80 90 100 110 120 



m015 . pep 



a015 



FX 
I 1 
FX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 015 shows 94.5% identity over a 91 aa overlap with a predicted ORF (ORF 015.ng) 
from K gonorrhoeae: 

m015/g015 

10 20 30 

mOlS . pep KIRKALAGFWKALPHLNDTMLLFTGLWLMKITH 

I hlllllllllllllllllllllMIII 

g015 LIVKYSHQIFVTITILVFNIRFFLLWKNPEKPLVGFWKALPHLNDTMLLFTGLWLMKITH 
10 20 30 40 50 60 

40 50 60 70 80 90 

m015 . pep FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCVACIVYIiAKTKVLP 

lllllllllllllllhllllllhllllllllllllllllllllhlllllllllllll 

g015 FSPFNAPWLGTKILLLFAYIALGMVMMRARPRSTKFYTVYLIAMCCIACIVYLAKTKVLP 
70 80 90 100 110 120 



m015 .pep 
g015 



FX 



FX 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ED 99>: 

gOX8 . seq 

1 atGCAGCAGG GGCagttggt tggacgcgtc gcccgcaata AAGATATGCG 

51 GAATgctggt CTGCATggtC AGCGGATCGG CAACGGGtac gccgcgcgcg 

101 tctttgTCGA TATTGATGTT TTCCAAACCG ATATtgTCAA CGTTCGGACG 

151 GCgACCTACG GCTGCCAACA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT tCtACATTGC CGTCTGCGTC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCMT tctTCTCCGA ACACGGCTTT 

301 CGCCTCGTCT GA 



This corresponds to the amino acid sequence <SEQ ID 100; ORF 018.ng>: 

gOlB.pep 

1 MQQGQLVGRV ARNKDMRNAG LHGQRIGNGY AARVFVDIDV FQTDIVNVRT 
51 ATYGCQHIFG NKY AFFAILL PMDFYIAVCV EFDLGFSIQM QFQFFSEHGF 



wo 99/57280 



PCT/US99/09346 



200 



101 RLV* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 101>: 

mOlS.seq 

1 ATGCAGCAGA GGCAGTTGGT TGGACGCATC GCCTGCGATG AAGATATGCG 

51 GAATACTGGT CTGCATGGTC AGCGGGTCGG CAACAGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATATT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCCACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCATC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCAAT TCTTCGCCGA ACACGGCGTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 102; ORF 01 8>: 

mOlS.pep 

1 MQQRQLVGRI ACDEDMRNTG LHGQRVGNRY AARIFFDIDI FQTDIVNVRT 
51 AAHGCQHIFG NKY AFFAILL PMDFYIAVCI EFDLGFSIQM QFQFFAEHGV 
101 RLV* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 103>: 

a018 . seq 

1 ATGCAGCAGG GGCAGTTGGT TGGACGCGTC GCCCGCAATA AAGATATGCG 

51 GAATACTGGT CTGCATAGTC AGCGGATCGG CAACGGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATGTT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCTACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCGTC GAGTTTGGCC 

251 TCGGTTTTAG CATCCAAATG CAGTTTCAAT TCTTCACCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 104; ORF 018.a>: 

aOlS.pep 

1 MQQGQLVGRV ARNKDMRNTG LHSQRIGNGY AARIFFDIDV FQTDIVNVRT 
51 AAYGCQHIFG NK YAFFAILL PMDFYIAVCV EFGLGFSIQM QFQFFTEHGF 
101 RLV* 

moi8/aoi8 86.4% identity over a 103 aa overlap 

10 20 30 40 50 60 

itiOlB . pep MQQRQLVGRIACDEDMRNTGLHGQRVGNRYAARIFFDIDIFQTDIVNVRTAAHGCQHIFG 
Ml 11111:1 I I I I I I I I I I : I I M I t I I I I I I : I I I I I { | 

a 0 1 8 MQQGQLVGRVARNKDMRNTGLHSQRI GNGYAT^ X FFD I D VFQT D I VN VRT AAYGCQH I FG 

10 20 30 40 50 60 

70 80 90 100 

m018 . pep NKYAFFAILLPMDFYIAVCIEFDLGFSIQMQFQFFAEHGVRLVX 
I I M I I i I I I I I I i I I I I 1 : I I I I I I I I I 1 I I M : I I I I I I I 
a018 NKYAFFAILLPMDFYIAVCVEFGLGFSIQMQFQFFTEHGFRLVX 
70 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 018 shows 84.5% identity over a 103 aa overlap with a predicted ORF (ORF OlS.ng) 
from A'', gonorrhoeae: 

m018/g018 

10 20 30 40 50 60 

mOlS.pep MQQRQLVGRIACDEDMRNTGLHGQRVGNRYAARIFFDIDIFQTDIVNVRTAAHGCQHIFG 

Ml ::|lll:||||||: |:| I I I : I I I I I | I I M I : : I f I I I I I 

g018 MQQGQLVGRVARNKDMRNAGLHGQRIGNGYAARVFVDIDVFQTDIVNVRTATYGCQHIFG 

10 20 30 40 50 60 
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70 80 90 100 

m018 . pep NKYAFFAILLPMDFYIAVCIEFDLGFSIQMQFQFFAEHGVRLVX 
I I t I I I I I I I I I I I t I I I I : I I I I j I 1 I I I I I I I I : I I I I I I I 
gOlS NKYAFFAILLPMDFYIAVCVEFDLGFSIQMQFQFFSEHGFRLVX 
70 80 90 100 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 105>: 

g019.seq (partial) 

1 . .ctgctggcgg ccctggtgct tgccgcgtgt tcttcgACAA ACAcacTGCC 
51 AGCCGGCAAG ACCCCGGCAG ACAATATAGA AActgcCgAC CTTTCGGCAA 
101 GCGTTCCCAC CCgcCCTGCC GAACCGGAAG GAAAAACGCT GGCAGATTAC 
151 GGCGGCTACC CGTCCGCACT GGATGCAGTG AAACAGAACA ACGATGCGGC 
201 AGCCGCCGCC TATTTGGAAA Acgcaggaga cagCGcgatg gcGGAAAatg 
251 tccgcaagga gtgGCTGa 

This corresponds to the amino acid sequence <SEQ ID 106; ORF 019.ng>; 

g019.pep (partial) 

1 . .LLAALVLAAC SSTNTLPAGK TPADNIETAD LSASVPTRPA EPEGKTLADY 
51 GGYPSALDAV KQNNDAAAAA YLENAGDSAM AENVRKEWL* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 107>: 

m019.seq (partial) 

1 ATGTACCTAC CCTCTATGAA GCATTCCCTG CCGCTGCTGG CGGCCCTGGT 

51 GCTTGCCGCG TGTTCTTCGA CAAACACACT GCCAGCCGGC AAGACCCCGG 

101 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCCGCCCT 

151 GCCGAACCCG AAAGAAAAAC GCTGGCAGAT TACGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCACGCAG ACAGTGGACG CTGTTTGCAC AGGAATACGC 

351 CAAACTCGAA CCGGCAGGGC GCGCCCAAGA AGTCGAATGC TACGCCGATT 

401 CGAGCCGCAA CGACTATACG CGTGCCGCTG AACTGGTCAA AAATACGGGC 

451 AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGGCAG CCGCATCCGG 

501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCGG 

551 GCCGCCAAAC CACAGACGCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 

601 TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 

651 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 

701 TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG GGGCGTATTG 

751 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 

801 CGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 

851 ACGCCCGCGC CGCCTTGCGC GCCCGACGTT GGGACGAGCT GGCCTCCGTT 

901 ATCTCGCATA TGCCCGAAAA ACTGCAAAAA AGCCCGACCT GGCTCTACTG 

951 GCTGGCACGC AGCCGCGCCG CAACGGGCAA CACGCAAGAG GCGGAAAAAC 

1001 TTTACAAACA GGCGGCAGCG ACGGGCAGGA ATTTTTATGC GGTGCTGGCA 

1051 GGGGAAGAAT TGGGTCGGAA AATCGATACG CGCAACAATG TGCCCGATGC 

1101 CGGCAAAAAC AGCGTCCGCC GCATGGCGGA AGACGGTGCA GTCAAACGCG 

1151 CACTGGTACT GTTCCAAAAC AGCCAATCTG CCGGTGATGC AAAAATGCGC 

1201 CGTCAGGCTC AGGCGGAATG GCGTTTTGCC ACACGCGGCT TTGACGAAGA 

1251 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 

1301 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 

1351 CGCTATATTT CGCCGTTTAA AGACACGGTA ATCCGCCACG CGCAAAATGT 

1401 TAATGTCGAT CCGGCTTGGG TTTATGGGCT GATTCGTCAG GAAAGCCGCT 

14 51 TCGTTATAGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 

1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 

1551 ACAACTTTAC ACCGCCGACG GG. . . 

This corresponds to the amino acid sequence <SEQ ID 108; ORF 01 9>: 

m019.pep (partial) 

1 MYLPSMKHSL PLLAALVLAA CSSTNT LPAG KTPADNIETA DLSASVPTRP 
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51 AEPERKTLAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LFAQEYAKLE PAGRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKSPNAAAL LSEMESGLSL EQRSFAWGVL 

251 GHYQSQNLNV PAALDYYGKV ADRRQLTDDQ lEWYARAALR ARRWDELASV 

301 ISHMPEKLQK SPTWLYWLAR SRAATGNTQE AEKLYKQAAA TGRNFYAVLiA 

351 GEELGRKIDT RNNVPDAGKN SVRRMAEDGA VKRALVLFQN SQSAGDAKMR 

401 RQAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 

451 RYISPFKDTV IRHAQNVNVD PAWVYGLIRQ ESRFVIGAQS RVGAQGLMQV 

501 MPATAREIAG KIGMDAAQLY TADG. . . 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 109>: 

a019. seq 

1 ATGTACCCAC CCTCTCTGAA GCATTCCCTG CCGCTGCTGG TGGNCCTGGT 

51 GCTTGCCGCG TGTTCTTNGA CAAACACACT GTCAGCCGAC AAGACCCCGG 

101 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCNGCCCT 

151 GCCGAACCCG AANGAAAAAC GTNGGCAGAT TACGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCGCGCAG ACAGTGGACG CTGTNTGCAC ANGAATATGC 

351 N/^CTCGAA CCGGCANGGC GCGCCCAAGA AGTCGAATGC TACGCCGATT 

401 CGAGCCGCAA CGACTATACG CGTGCCGCCG AACTGGTCAA AAATACGGGC 

451 AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGGCAG CCGCATCCGG 

501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 

551 GCCGCCAAAC CACAGACGCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 

601 TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 

651 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 

701 TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG GGGCGTATTG 

751 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 

801 NGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 

651 ACGCCCGCGC CGCNNTNNGC NNNCGNNGTT NGNANGANNT GGCNNCCGNN 

901 ANCNCGNNKN TGCNNGANAA ACNNNNNNAN AGNCNNANNT NGNTNNANTG 

951 NNTGGCACGC AGCCGCGCCG CNACGGGCAA CACGCAANAN GCGGMAAAC 

1001 TNTACAAACA GGCGGCAGCA NCGGGCANGA ATTTTTATGC NGTGCTGNCN 

1051 GGGGAAGAGT TGGGGCGCAN AATCGATACG CGCAACAATG TGCCCGATGC 

1101 CGGCAAAANC AGCGTCCTCC GTATGGCGGA AGACGGCGCG ATTAAGCGCG 

1151 CGCTGGTGCT GTTCCGAAAC AGCCGAACCG CCGGCGATGC GAAAATGCGC 

1201 CGTCNGGCTC AGGCGGAATG GCGTTTCGCC ACACGCGGCT TCGATGAAGA 

1251 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 

1301 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 

1351 CGCTACATTT CGNNNNNTNA NGACACGGTA ATCCGCCACG CGCAAAATGT 

1401 TAATGTCGAT CCGGCGTGGG TTTACGGGCT GATTCGTCAG GAAAGCCGCT 

1451 TCGTTATGGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 

1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 

1551 ACAACTTTAC ACCGCCGACG GCAATATCCG TATGGGGACG TGGTATATGG 

1601 CGGACACCAA ACGCCGCCTG CAAAACAACG AAGTCCTCGC CACCGCAGGC 

1651 TATAACGCCG GTCCCGGCAG GGCGCGCCGA TGGCAGGCGG ACACGCGGCT 

1701 CGAAGGCGCG GTATATGCCG AAACCATCCC GTTTTCCGAA ACGCGCGACT 

1751 ATGTCAAAAA AGTGATGGCC AATGCCGCCT ACTACGCCTC CCTCTTCGGC 

1801 GCGCCGCACA TCCCGCTCAA ACAGCGTATG GGCATTGTCC CCGCCCGCTG 

1851 A 

This corresponds to the amino acid sequence <SEQ ID 110; ORF 019.a>: 

a019.pep 

1 MYPPSLKHSL PLLVXLVLAA CSXTNT LSAD KTPADNIETA DLSASVPTXP 

51 AEPEXKTXAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LXAXEYAKLE PAXRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKSPNAAAL LSEMESGLSL EQRSFAWGVL 

251 GHYQSQNLNV PAALDYXGKV ADRRQLTDDQ lEWYARAAXX XRXXXXXAXX 

301 XXXXXXKXXX XXXXXXXXAR SRAATGNTQX AXKLYKQAAA XGXNFYAVLX 

351 GEELGRXIDT RNNVPDAGKX SVLRMAEDGA IKRALVLFRN SRTAGDAKMR 

4 01 RXAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 
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451 RYISXXXDTV IRHAQNVNVD PAWVYGLIRQ ESREVMGAQS RVGAQGLMQV 

501 MPATAREIAG KIGMDAAQLY TADGNIRMGT WYMADTKRRL QNNEVLATAG 

551 YNAGPGRARR WQADTPLEGA VYAETIPFSE TRDYVKKVMA NAAYYASLFG 

601 APHIPLKQRM GIVPAR* 



moi9/aoi9 88.9% identity over a 524 aa overlap 



10 20 30 40 50 60 

m019.pep MYLPSMKHSLPLLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPERKTLAD 
II 11:1111111: II I I I I I II I I i I II II I I I I II I I I II i i I I I I I II II 
a019 MYPPSLKHSLPLLVXLVLAACSXTNTLSADKTPADNIETADLSASVPTXPAEPEXKTXAD 

10 20 30 40 50 60 

70 80 90 100 110 120 

m019.pep YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 

I I I I II I I I I I I I I I I II I I I t I II I I I t I I I I I I I II I M M I I M I I I I I II I I I I 
a 0 1 9 YGG YPS ALDAVKQKN DAAVAAYLENAGDS AMAEN VRNEWLKS LGARRQWT LXAXE YAKLE 

70 80 90 100 110 120 

130 140 150 160 170 180 

mOl 9 . pep PAGRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 

II M I I I I I I I II I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II t I II 
a019 PAXRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 

130 140 150 160 170 180 

190 200 210 220 230 240 

mOl 9 . pep LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLNVIGKEARKSPNAAALLSEMESGLSL 
I I I I I I I M I I I i I I I I I I I I I I I I I I i I I II I I II I I t I I I I I I I I I I I i I I I I I I I I I 
a019 LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLNVIGKEARKSPNAAALLSEMESGLSL 
190 200 210 220 230 240 

250 260 270 280 290 300 

mOl 9 . pep EQRSFAWGVLGHYQSQNLNVPAALDYYGKVADRRQLTDDQIEWYARAALRARRWDELASV 

I I I I I I I I I I I I I I II I I I I I I ! I i I I I I I I M i I I I I I I I 1 I I I I I I I 
a 0 1 9 EQRS FAWGVLGHYQSQNLN V PAALD YXGKVADRRQLT DDQIEW YARAAXXXRXXXXXAXX 

250 260 270 280 290 300 

310 320 330 340 350 360 

mOl 9 . pep ISHMPEKLQKSPTWLYWLARSRAATGNTQEAEKLYKQAAATGRNFYAVLAGEELGRKIDT 
I : IIIIIIIMII I 11111111:1 llllll lltlll III 

a019 XXXXXXKXXXXXXXXXXXARSRAATGNTQXAXKLYKQAAAXGXNFYAVLXGEELGRXIDT 

310 320 330 340 350 360 

370 380 390 400 410 420 

m019.pep RNNVPDAGKNSVRRMAEDGAVKRALVLFQNSQSAGDAKMRRQAQAEWRFATRGFDEDKLL 
I I M I I I I I II I I I I I I I: I I I I II I: II: : I M i I I I I I I I I I I I I I I I I I I I jj | 
a019 RNNVPDAGKXSVLRMAEDGAIKRALVLFRNSRTAGDAKMRRXAQAEWRFATRGFDEDKLL 

370 380 390 400 410 420 

430 440 450 460 470 480 

m019.pep TAAQTAFDHGFYDMAVNSAERTDRKLNYTLRYISPFKDTVIRHAQNVNVDPAWVYGLIRQ 
i I I I I I I It I I I I I I I I t I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II 
a019 TAAQTAFDHGFYDMAVNSAERTDRKLNYTLRYISXXXDTVIRHAQNVNVDPAWVYGLIRQ 

430 440 450 460 470 480 



490 500 510 520 

raO 1 9 . pep ESRFVIGAQSRVGAQGLMQVMPATAREIAGKIGMDAAQLYTADG 

ltll|:|llltlMllllllllillllilllllllllll 

a019 ESRFVMGAQSRVGAQGLMQVMPATAREIAGKIGMDAAQLYTADGNIRMGTWYMADTKRRL 

490 500 510 520 530 540 



a019 



QNNEVLATAGYNAGPGRARRWQADTPLEGAVYAETIPFSETRDYVKKVMANAAYYASLFG 
550 560 570 580 590 600 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 019 shows 95.5% identity over a 89 aa overlap with a predicted ORF (ORF 019.ng) 
from N, gonorrhoeae: 

g019/ni019 

10 20 30 40 49 

g019 . pep LLAALVIiAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPEGKTLAD 

llllllllilllllMIIIIIIIIIIIIIIIIIIIIIIIIIII IIIM 

m019 MYLPSMKHSLPLLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPERKTLAD 
10 20 30 40 50 60 



50 60 70 80 89 

g 0 1 9 . pep YGGYPSALDAVKQNNDAAAAAYLENAGDSAMAENVRKEWL 

llllllllllllhlllhllllllllllllllllhlll 

mO 1 9 YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQE YAKLE 

70 80 90 100 110 120 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 1 1 1>: 

g023 . seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 AATGCAGCGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCCCTG CCTAAAGAAT ATCCGGCATG GCAGGCATTT 

151 TTTAGTCAAG CTTGGGTAAA AGTATTTACC CAAGTGAGCT TTATCGCCGT 

201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 

251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT TGtctGGCTG 

301 GTCGGCTGCC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 023.ng>: 

g023.pep 

1 MVERKLTGAH YGLRDWVMQR ATAVIMLIYT VALLWLFAL P KEYPAWOAF 
51 FSQAWVKVPT QVSFIAVFLH AWVGIRDLWM DYIKPFGV RL FLQVATIVWL 
VGCLVYSVKV IWG* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 1 13>: 

m023.seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 GATGCAACGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTTCCCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 

201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 

251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTTTGGCTG 

301 GTCGGCTGTC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 14; ORF 023>: 

m023 .pep 

1 MVERKLTGAH YGLRDWVMQR ATAV I MLIYT VALLWLFSL P KEYSAWOAF 

51 FSQTWVKVFT QVSFIAVFLH AWVGIRDLWM DYIKPFGV RL FLQVATIVWL 

101 VGCLVY SVKV IWG* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 1 15>: 

a023.seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GGGATTGGGC 

51 GATGCAACGT GCGACCGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCTCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 
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205 



201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATNA 
251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTCTGGCTG 
301 GTCGGCTGCT TGGTGTATTC AATTAAAGTA ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 16; ORF 023. a>: 

a023.pep 

1 MVERKLTGAH YGLRDWAMQR ATA V IMLIYT VALLWLFAL P KEYSAWQAF 

51 FSQTWVKV FT QVSFIAVFLH AWVGI RDLWM DYXKPFGVR L FLQVATIVWL 

101 VGCLVYSIKV IWG* 

m023/a023 96.5% identity over a 1 1 3 aa overlap 

10 20 30 40 50 60 

m023 . pep MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLWLFSLPKEYSAWQAFFSQTWVKVFT 
I I I I I I I I I I I I I I t I : I I tn I M I I I I I I I I I I I I I : I I I I I I I M I t I I I I I I t I I I 
a023 MVERKLTGAHYGLRDWAMQRATAVIMLIYTVALLWLFALPECEYSAWQAFFSQTWVKVFT 
10 20 30 40 50 60 

70 80 90 100 110 

m023.pep QVSFIAVFLHAWVGIRDLWMDYIKPFGVRLFLQVATIVWLVGCLVYSVKVIWGX 
I I I i I I I I I I t I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I t I I : I M I I I 
a023 QVSFIAVFLHAWVGIRDLWMDYXKPFGVRLFLQVATIVWLVGCLVYSIKVIWGX 

70 80 90 100 110 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 023 shows 97.3% identity over a 1 13 aa overlap with a predicted ORF (ORF 023 .ng) 
from N. gonorrhoeae: 

g023/m023 

10 20 30 40 50 60 

g023 . pep MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLVVLFALPKEYPAWQAFFSQAWVKVFT 

lllllllllllllllilMIMIIIIIIIIIIMIIII:|llll lllllllhllllll 

m023 MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLVVLFSLPKEYSAWQAFFSQTWVKVFT 
10 20 30 40 50 60 



70 80 90 100 110 

g023 . pep QVSFIAVFLHAWVGIRDLWMDYIKPFGVRLFLQVATIVWLVGCLVYSVKVIWGX 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

m023 QVSFIAVFLHAWVGIRDLWMDYIKPFGVRLFLQVATIVWLVGCLVYSVKVIWGX 
70 80 90 100 110 



The following partial DNA sequence 



g025 . seq 






1 


ATGTTGAAAC 


AAAcgACACT 


51 


GTTGGGCGGT 


TGcgCCACCC 


101 


CAGGTATGCA 


GACCGTATCG 


151 


ACGCCGTACA 


ATGCCGCTCC 


201 


CGTGCAAact 


gcgccggttT 


251 


CACCTGCCGT 


TTCGGgtaca 


301 


aacgCGGCGa 


cgCataCTAT 


351 


caaAcgCtac 


CATATCTCTC 


401 


CCGACAATAC 


GTTGAGCATC 


451 


TATGCCGCAC 


CGAAAACCGC 


501 


TGCCGCGCAA 


ACCCCTGTGA 


551 


CGCCGCAACC 


TGCCGCGCCC 


601 


GCGCCCGCCC 


CGCAATCTCC 


651 


CGGCGGCATT 


GTTTGGCAGC 



identified in K gonorrhoeae <SEQ ID 1 17>: 

TTTGGCAGCT TGTACCGCCG TTGCCGCTCT 
AACAGCCTGC TccTGTCATT GCAGGCAATT 
TCTGCGCCGG TTTACAATCC TTATGGCGCA 
TGCCGCCAac gatgcGCCgT ATGTGCCGCC 
ATTCGCCTCC TGCTTATGTT CCGCcgtCTG 
tatgtTCCTT CTTACGCACC CgtcgACATC 
TGTGCGTGGC GACACgGtgt acaACATTTc 
AAGACGATTT CCGTGCGTGG AACGGCATGA 
GGTCAGATTG TTAAAGTCAA ACCGGCaggA 
AGCCGTAGAA AGCAGGCCCG CCGTACCGGC 
AACCCGCCGC gcaACCGCCC GTTCAGTCCG 
GCTGCGGAAA ATAAAGCGGT TCCCGCCCCC 
TGCCGCTTCG CCTTCCGGCA CGCGTTCGGT 
GTCCGACCCA AGGTAAAGTG GTTGCCGATT 
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701 TCGGCGGCGG CAACAAGGGT GTCGATATTG CCGGCAATGC CGGACAACCC 

751 GTTTTGGCGG CGGCTGACGG CAAAGTGGTT TATGCCGGTT CAGGTTTGAG 

801 GGGATACGGA AACTTGGTCA TCATCCAGCA CAATTCCTCT TTCCTGACCG 

851 CGTACGGGCA CAACCAAAAA TTGCTGGTCG GCGAAGGTCA GCAGGTCAAA 

901 CGCGGTCAGC AGGTTGCTTT GATGGGTAAT ACCGATGCTT CCAGAACGCA 

951 GCTTCATTTC GAGGTGCGTC AAAACGGCAA ACCGGTTAAC CCGAACAGCT 

1001 ATATCGCGTT CTGA 



This corresponds to the amino acid sequence <SEQ ID 1 18; ORF 025.ng>: 

g025 .pep 

1 MLKQTTLLAA CTAVAALLGG CATQQPAPVI AGNSGMQTVS SAPVYNPYGA 

51 TPYNAAPAAN DAPYVPPVQT APVYSPPAYV PPSAPAVSGT YVPSYAPVDI 

101 NAATHTIVRG DTVYNISKRY HISQDDFRAW NGMTDNTLSI GQIVKVKPAG 

151 YAAPKTAAVE SRPAVPAAAQ TPVKPAAQPP VQSAPQPAAP AAENKAVPAP 

201 APAPQSPAAS PSGTRSVGGI VWQRPTQGKV VADFGGGNKG VDIAGNAGQP 

251 VLAAADGKW YAGSGLRGYG NLVIIQHNSS FLTAYGHNQK LLVGEGQQVK' 

301 RGQQVALMGN TDASRTQLHF EVRQNGKPVN PNSYIAF* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 119>: 



m025 . seq 


(partial) 










1 


. .GTGCCGCCGG 


TGCAAAGCGC 


GCCGGTTTAT 


ACGCCTCCTG 


CTTATGTTCC 


51 


GCCGTCTGCA 


CCTGCCGTTT 


CGGGTACATA 


CGTTCCTTCT 


TACGCACCCG 


101 


TCGACATCAA 


CGCGGCGACG 


CATACTATTG 


TGCGCGGCGA 


CACGGTGTAC 


151 


AACATTTCCA 


AACGCTACCA 


TATCTCTCAA 


GACGATTTCC 


GTGCGTGGAA 


201 


CGGCATGACC 


GACAATACGT 


TGAGCATCGG 


TCAGATTGTT 


AAAGTCAAAC 


251 


CGGCAGGATA 


TGCCGCACCG 


AAAGCCGCAG 


CCGTAAAAAG 


CAGGCCCGCC 


301 


GTACCGGCTG 


CCGCGCAACC 


GCCCGTACAG 


TCCGCACCCG 


TCGACATTAA 


351 


CGCGGCGACG 


CATACTATTG 


TGCGCGGCGA 


CACGGTGTAC 


AACATTTCCA 


401 


AACGCTACCA 


TATCTCTCAA 


GACGATTTCC 


GTGCGTGGAA 


CGGCATGACC 


451 


GACAATATGT 


TGAGCATCGG 


TCAGATTGTT 


AAAGTCAAAC 


CGGCAGGATA 


501 


TGCCGCACCG 


AAAACCGCAG 


CCGTAGAAAG 


CAGGCCCGCC 


GTACCGGCTG 


551 


CCGTGCAAAC 


CCCTGTGAAA 


CCCGCCGCGC 


AACCGCCTGT 


GCAGTCCGCG 


601 


CCGCAACCTG 


CCGCGCCCGC 


TGCGGAAAAT 


AAAGCGGTTC 


CCGCGCCCGC 


651 


CCCGCAATCT 


CCTGCCGCTT 


CGCCTTCCGG 


CACGCGTTCG 


GTCGGCGGCA 


701 


TTGTTTGGCA 


GCGTCCGACG 


CAAGGTAAAG 


TGGTTGCCGA 


TTTCGGCGGC 


751 


AACAACAAGG 


GTGTCGATAT 


TGCCGGTAAT 


GCGGGACAGC 


CCGTTTTGGC 


801 


GGCGGCTGAC 


GGCAAAGTGG 


TTTATGCCGG 


TTCAGGTTTG 


AGGGGATACG 


851 


GAAACTTGGT 


CATCATCCAG 


CATAATTCTT 


CTTTCCTGAC 


CGCATACGGG 


901 


CACAACCAAA 


AATTGCTGGT 


CGGCGAGGGG 


CAGCAGGTCA 


AACGCGGTCA 


951 


GCAGGTTGCT 


TTGATGGGCA 


ATACCGATGC 


TTCCAGAACG 


CAGCTTCATT 


1001 


TCGAGGTGCG 


TCAAAACGGC 


AAACCGGTTA 


ACCCGAACAG 


CTATATCGCG 


1051 


TTCTGA 











This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 025>: 

m025.pep (partial) 

1 . .VPPVQSAPVY TPPAYVPPSA PAVSGTYVPS YAPVDINAAT HTIVRGDTVY 

51 NISKRYHISQ DDFRAWNGMT DNTLSIGQIV KVKPAGYAAP KAAAVKSRPA 

101 VPAAAQPPVQ SAPVDINAAT HTIVRGDTVY NISKRYHISQ DDFRAWNGMT 

151 DNMLSIGQIV KVKPAGYAAP KTAAVESRPA VPAAVQTPVK PAAQPPVQSA 

201 PQPAAPAAEN KAVPAPAPQS PAASPSGTRS VGGIVWQRPT QGKWADFGG 

251 NNKGVDIAGN AGQPVLAAAD GKWYAGSGL RGYGNLVIIQ HNSSFLTAYG 

3 01 HNQKLLVGEG QQVKRGQQVA LMGNTDASRT QLHFEVRQNG KPVNPNSYIA 

351 F* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1>: 

a025.seq 

1 ATGTTGACAC CAACAACACT TTAGGTAGCT TGTACCGCCC TTGCCGCTCA 

51 GTTGGGCGGA TGCCGCACCG AACACCCTTC TCCTGTCATT GCAGGCAATT 

101 CAGGTATGCA GACCGTACCG TCTGCGCCGG TTTACAATCC TTATGGCGCA 
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151 


ACGCCGTACA 


201 


GGTGCAAAGC 


251 


CACCTGCCGT 


301 


AACGCGGCGA 


351 


CAAATGCTAC 


401 


CCGACAATAC 


451 


TATGCCGCAC 


501 


TGCCGCGCAA 


551 


CGCATACTAT 


601 


CATATCTCTC 


651 


GTTGAGCATC 


701 


CGAMGCCGC 


751 


ACCCCTGTGA 


801 


TGCCGCGCCC 


851 


CTCCTGCCGC 


901 


CAGCGTCCGA 


951 


GGGTGTCGAT 


1001 


ACGGCAAAGT 


1051 


GTCATCATCC 


1101 


AAAATTGCTG 


1151 


CTTTGATGGG 


1201 


CGGCAAAACG 



ATGCCGCTCC 
GCGCCGGTTT 
TTCGGGTACA 
CGCATACTAT 
CATATCTCTC 
GTTGAGCATC 
CGAAAGCCGC 
CCGCTCGTAC 
TGTGCGCGGC 
AAGACGATTT 
GGTCAGATTG 
AGCCGTAAAA 
AACCCGCCGC 
GCTGCGGAAA 
TTCGCCTTCC 
CGCAAGGTAA 
ATTGCAGGAA 
GGTTTATGCA 
AGCATAATTC 
GTCGGCGAAG 
CAATACCGAG 
GCAAACCGGT 



TGCCGCCAAC 
ATANGCCTCC 
TACGTTCCTT 
TGTGCGCGGC 
AAGACGATTT 
GGTCAGATTG 
AGCCGTAAAA 
AGTCCGCACC 
GACACGGTGT 
CCGTGCGTGG 
TTAAAGTCAA 
AGCAGGCCCG 
GCAACCGCCT 
ATAAAGCGGT 
GGCACGCGTT 
AGTGGTTGCC 
ATGCGGGACA 
GGTTCCGGTT 
TTCCTTCCTG 
GCCAGCAGGT 
GCTTCTAGAA 
TAATCCGAAC 



GATGCGCCGT 
TGCTTATGTT 
CTTACGCANC 
GACACGGTGT 
CCGTGCGTGG 
TTAAAGTCAA 
AGCAGGCCCG 
CGTCGACATC 
ACAACATTTC 
AACGGCATGA 
ACCGGCAGGA 
CCGTACCGGC 
GTGCAGTCCG 
TCCCGCGCCC 
CGGTCGGCGG 
GATTTCGGCG 
GCCCGTTTTG 
TGAGGGGATA 
ACCGCATACG 
CAAACGCGGG 
CGCAGCTTCA 
AGCTATATCG 



ATGTGCCGCC 
CCGCCGTCTG 
CGTCGACATC 
ACAAGATTTC 
7WVCGGCATGA 
ACCGGCAGGA 
CCGTACCGGC 
AACGCGGCGA 
CAAACGCTAC 
CCGACAATAC 
TATGCCGCAC 
TGCCGTGCAA 
CGCCGCAACC 
GCCCCGCAAT 
CATTGTTTGG 
GCAACAACAA 
GCGGCGGCTG 
CGGCAATTTG 
GGCACAACCA 
CAGCAGGTCG 
TTTCGAGGTG 
CGTTCTGA 



This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 025.a> 

a025 .pep 

MLTPTTL*VA CTALAAQLG G CPTQHPSPVI AGNSGMQTVP 
TPYNAAPAAN DAPYVPPVQS 
DTVYKISKCY 
SRPAVPAAAQ 
NGMTDNTLSI 



1 
51 
101 
151 
201 
251 
301 
351 
401 



NAATHTIVRG 
YAAPKTVAAVK 
HISQDDFRAW 
TPVKPAAQPP 



APVYXPPAYV PPSAPAVSGT 
HISQDDFRAW NGMTDNTLSI 
PLVQSAPVDI NAATHTIVRG 
GQIVKVKPAG YAAPKAAAVK 



VQSAPQPAAP AAENKAVPAP APQSPAASPS 
QRPTQGKWA DFGGNNKGVD lAGNAGQPVL AAADGKWYA 
VIIQHNSSFL TAYGHNQKLL VGEGQQVKRG QQVALMGNTE 
RQNGKPVNPN SYIAF* 



SAPVYNPYGA 
YVPSYAXVDI 
GQIVKVKPAG 
DTVYNISKRY 
SRPAVPAAVQ 
GTRSVGGIVW 
GSGLRGYGNL 
ASRTQLHFEV 



m025/a025 97.4% identity over a 351 aa overlap 

10 20 30 

m025 . pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

I I I I I I I I I I : I I I I I I I I i I I I I I I I I I i 
a025 GMQTVPSAPVYNPYGATPYNAAPAANDAPYVPPVQSAPVYXPPAYVPPSAPAVSGTYVPS 
40 50 60 70 80 90 

40 50 60 70 80 90 

m025.pep YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
II I M I I I I I I I I I II I I I: I I I II II II I II I I I II II II II I I II I I I II I I I I I I 
a025 YAXVDINAATHTIVRGDTVYKISKCYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
100 110 120 130 140 150 



100 110 120 130 140 150 

ra025 . pep KAAAVKSRPAVPAAAQPPVQSAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMT 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I II II I I I 
a025 KAAAVKSRPAVPAAAQPLVQSAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMT 
160 170 180 190 200 210 

160 170 180 190 200 210 

m025 . pep DNMLSIGQIVKVKPAGYAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 
I i I I I I I I I I I I I I I I I I I I: II I : I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
a025 DNTLSIGQIVKVKPAGYAAPKAAAVKSRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 
220 230 240 250 260 270 



m025.pep 



220 230 240 250 260 270 

KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVLAAAD 
M I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I M II I I I M I I I I I I I I I I I I 
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a025 KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVLAAAD 
280 290 300 310 320 330 

280 290 300 310 320 330 

m025 . pep GKVVYAGSGLRGYGNLVIIQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDASRT 
I N I I I I I I I I I I I I I I 1 I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I : I I 1 I 
a025 GKWYAGSGLRGYGNLVIIQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTEASRT 
340 350 360 370 380 390 

340 350 
Itl025 .pep QLHFEVRQNGKPVNPNSYIAFX 
I I I I I I I M I M I M I I I I I I 1 
a025 QLHFEVRQNGKPVNPNSYIAFX 
400 410 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 025 shows 75.6% identity over a 353 aa overlap with a predicted ORF (ORF 025.ng) 
from A^, gonorrhoeae: 

m025/g025 

10 20 30 

m02 5 . pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

llllhlllhillllllllllllllllll 

g02 5 GMQTVSSAPVYNPYGATPYNAAPAANDAPYVPPVQTAPVYS PPAYVPPSAPAVSGTYVPS 

40 50 60 70 80 90 

40 50 60 70 80 90 

m02 5 . pep YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIilllllllllllllMIIIIII 

g025 YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
100 110 120 130 140 150 



100 110 120 130 140 150 

m025 . pep KAAAVKSRPAVPAAAQPPVQSAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMT 
I 

g025 K 



160 170 180 190 200 210 

m025 . pep DNMLSIGQIVKVKPAGYAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

llllllllllllhlllllllllllllllllllllllll 

g025 TAAVESRPAVPAAAQTPVKPAAQPPVQSAPQPAAPAAEN 

160 170 180 190 

220 230 240 250 260 

m025 .pep KAVPAPAP--QSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVLAA 

lillllll IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMhlllllMMIIIIIIII 

g025 KAVPAPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGGNKGVDIAGNAGQPVTiAA 
200 210 220 230 240 250 



270 280 290 300 310 320 

raO 2 5 . pep ADGKWYAGSGLRGYGNLVI I QHNSS FLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDAS 

lllllllllllllllillllllllMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

g025 ADGKWYAGSGLRGYGNLVI IQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDAS 

260 270 280 290 300 310 



330 340 350 

m025 . pep RTQLHFEVRQNGKPVNPNSYIAFX 
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Illlllllllllllllllllllll 

g 0 2 5 RTQLHFEVRQNGKPVNPNSYIAPX 
320 330 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ED 1 13>: 

g031 . seq 

1 ATGGTGTCCC TCCGCTTCAG ATTCGGCAAC CACTTTAAAC GCCGACATTC 

51 TGACAATTTC CTTTTCCGCC AGCCAAATAT CATGCGTATC TTTCGGTTCG 

101 GGCTTGTTGG GCATGGCAAC CTTCAACAGC CGCGCCATCA CAGGAATCGT 

151 CGTTCCCTGA ATCAGCAGCG ACAGCACCAC CACGGCAAAC GCCACATCAA 

201 ACAGCAGGTG CGAATTGGGA ACGCCCATCA CCAGCGGCAT CATCGCCAGC 

251 GAAATCGGTA CGGCTCCTCG CAAGCCCAAC CAACTGATAT ACGCCTTTTC 

301 ACGCAGGCTG TAATTGAATT TCCACAAACC GCCGAACACT GCCAGCGGAC 

351 GCGCGACCAG CATCAGGAAC GCCGCAATCG CCAAGGCTTC CGCCGCCCTG 

401 TCCAACACGC CGGCGGGAGA AACCAGCAGA CCGAGCATGA CGAACAAAGT 

451 TGCCTGCGCC AGCCAAGCCA AACCGTCCAT CACACGCAAA ACGTGTTCCG 

501 TcgcACGGTT GCGCTGGTTA CCGACAATGA TGCCGGCAAG GTAAACCGCC 

551 AAAAAGCCGC TGCCGCCTAT GGTATTGGTA AACGCAAACA CAAGCAGCCC 

601 GCCCGACACA ATCATCAGCG CGTACAGACC TTCCGtacac acctccaatt 

651 cccaatcaac gtcatagctg tctcccgtgt taaaatgttc ttcacttcag 

701 aatccccccc ttcttcccag cccgaaacct tcatgtgtta naccctgggg 

751 tgccccaacg gatttagtaa cctcccaatg actctgcttg tcgccccctt 

801 cgcccgcttt ctccttccgg gaaaacttgt tgtccccgtc ttacattaa 

This corresponds to the amino acid sequence <SEQ ID 114; ORF 03 1 .ng>: 

gOai.pop 

1 MVSLRFRFGN HFKRRHSDNF LFRQPNIMRI FRFGLVGHGN LQQPRHHRNR 

51 RSLNQQRQHH HGKRHIKQQV RIGNAHHQRH HRQRNRYGSS QAQPTDIRLF 

101 TQAVIEFPQT AEHCQRTRDQ HQERRNRQGF RRPVQHAGGR NQQTEHDEQS 

151 CLRQPSQTVH HTQNVFRRTV ALVTDNDAGK VNRQKAAAAY GIGKRKHKQP 

201 ARHNHQRVQT FRTHLQFPIN VIAVSRVKMF FTSESPPSSQ PETFMCXTLG 

251 CPNGFSNLPM TLLVAPFARF LLPGKLW PV LH* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 1 15>: 

m031.seq (par'tial) 

1 ... CGCCTGT^GC ACGGTGTCGG ACTGCATTTC TATTCGGCTA TACGCCTTTT 
51 CACGCAGGCT GTAATTGAAT TTCCACAAAC CGCCGAACAC TGCCGACGGA 

101 CGCGCGACCA GCATCAGGAA CGCCGCAATC GCCAAgGCTT CCGCCGCCCT 

151 GTCCAACACG TTGGCAGGAG AAACCAGCAG CAAAGGCATT CCCAAACGTG 

201 CGGACAAAGT GGTCGAAACC ACGCTCAGAA ACAACAGTGC GCCACCCGGC 

251 AG 

This corresponds to the amino acid sequence <SEQ ED 1 16; ORF 03 1>: 

m031.pep <partial) 

1 . . . RLKHGVGLHF YSAIRLFTQA VIEFPQTAEH CRRTRDQHQE RRNRQGFRRP 

51 VQHVGRRNQQ QRHSQTCGQS GRNHAQKQQC ATRQ 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 17>: 

a031 . seq 

1 ATACGCCTTT TCACGCAGGC TGTAATTGAA TTTCCACAAA CCGCCGAACA 

51 CTGCCGGCGG ACGCGCGACC AGCATCAGGA ACGCCGCAAT CGCCAAGGCT 

101 TCCGCCGCCC CGTCCAACAC GTTGGCAGGA GAAACCAGCA GCAAAGGCAT 

151 TCCCAAACGT GCGGACAAAG TGGTCGAAAC CACGCTCAGA AACAACAGTG 

201 CGCCACCCGG CAG 

This corresponds to the amino acid sequence <SEQ ID 1 18; ORF 031.a>; 

a031.pep (partial) 

1 IRLFTQAVIE FPQTAEHCRR TRDQHQERRN RQGFRRPVQH VGRRNQQQRH 

51 SQTCGQSGRN HAQKQQCATR Q 
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m03i/a03i 100.0% identity over a 71 aa overlap 

10 20 30 40 50 60 

in031 . pep RLKHGVGLHFYSAIRLFTQAVIEFPQTAEHCRRTRDQHQERRNRQGFRRPVQHVGRRNQQ 

I I I 1 I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I 
a031 IRLFTQAVIEFPQTAEHCRRTRDQHQERRNRQGFRRPVQHVGRRNQQ 

10 20 30 40 



70 80 
m03 1 . pep QRHSQTCGQSGRNHAQKQQCATRQ 
I I I M I I I I I I I I I I I I I I I I I I I 
a031 QRHSQTCGQSGRNHAQKQQCATRQ 
50 60 70 



Computer analysis of this amino acid sequence gave the following results; 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 03 1 shows 60.0% identity over a 85 aa overlap with a predicted ORF (ORF 03 1 .ng) 
from N, gonorrhoeae: 

m031/g031 

10 20 30 

m031 . pep RLKHGVGLHFYSAIRLFTQAVIEFPQTAEH 

I I : : I I I I I I I I I I i I I I I I I 
g031 NQQRQHHHGKRHIKQQVRIGNAHHQRHHRQRNRYGSSQAQPTDIRLFTQAVIEFPQTAEH 
60 70 80 90 100 110 

40 50 60 70 80 

m031.pep CRRTRDQHQERRNRQGFRRPVQHVGRRNQQQRHS-QTCGQSGRNHAQKQQCATRQ 
I : I I I M I I I I i M I I I I I I I I I : I Mil : I : hi : : : : : : I : | : 
g031 CQRTRDQHQERRNRQGFRRPVQHAGGRNQQTEHDEQSCLRQPSQTVHHTQNVFRRTVALV 
120 130 140 150 160 170 



g031 



TDNDAGKVNRQKAAAAYGIGKRKHKQPARHNHQRVQTFRTHLQFPINVIAVSRVKMFFTS 
180 190 200 210 220 230 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 119>: 

g032 . seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTA TTGCGCCGCC CACGATTCGA 

51 GGCGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAAGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGACGC TGCTTGCGCC 

201 CTTTGCCGGT AACGTGTACC CACGCTTCGT CCAMTATAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGCTC 

301 GAACAGCGCG TGGTCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 T^TCCAACAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

401 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG CGCGCATCAG 

451 CCCGCGTTTG ACCAGCCAGG CGCAATACTG CCGCCACGCC GACAGCTTGC 

501 GCGCCAGCGT CCGACCGTCC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GccgTAAAAT CGCGCCGCGA CAAGTCCTGC GGCACGCcgc ctgcaTCTTC 

601 AGACGGCATT TGTGCCAACA GTGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCAAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

701 TAAGCGTCAA AATacgccgC AAACccgTCC AAAACCATAA CCGTCCCACA 

751 CAAATATCAA AAAACCAGTG A 

This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 032.ng>: 

g032.pep 

1 MRRNVPAVAV LRRPRFEAFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFAGQR NLTLLAPFAG NVYP RFVQIY IICIOAVYLA HA QTAAVHQL 
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101 EQRWAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGAHQ 
151 PAFDQPGAIL PPRRQLARQR PTVQTALRQP PQRRRKIAPR QVLRHAACIF 
201 RRHLCQQCKQ FFQIAPVCRN RVLRLALAHD VFQISVKIRR KPVQNHNRPT 
251 QISKNQ* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 11 1>: 

;TGCmTGC itiGTCGCCGTT kTGCGCCGCC CATTGCGCCA 



m032 . seq 


(partial) 


1 


ATGCGGCGAA 


51 


AACGTTTTTG 


101 


AACAGGGCTT 


151 


CAGGGCTTCC 


201 


CTTTGCCGAT 


251 


TTCAAGCCGT 


301 


GAACAGGGCG 


351 


AATCCAGCAT 


401 


TCGGGCTGCT 


451 


GCCGCGCTTT 


501 


GAGCCAGCGT 



GGGTACGCGC 



This corresponds to the amino acid sequence <SEQ ID 112; ORF 032>: 

m032.pep (partial) 

1 MRRNVXAVAV XRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFADQR HLPLXAPFAD NVYPR XVQID IICIQAVYLA HA QTAAVHQF 

101 EQGWAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGVHQ 

151 AALYQPNAIL PPRRKLASQR PFPQTA. . . 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 1 13>: 

a032.seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTT TTGCGCCGCC CATTGCGCCA 

51 AACGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAGGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGCCGC TGCTTGCGTC 

201 CTTTGCCGGT AACGTGTACC CACGCCTCGT CCAAATATAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGTTC 

301 GAACAGCGCG TGATCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAGCAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG TATGCAGCAG 

4 51 ACCGCGTTTG ACCAGCCAGG CGCAATACTG CCGCCAAGAC GACAGCTTGC 

501 GCGCCAGCGT CCGCGCATTC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GCCGTAAAAT CGCGCTGCGA CAAGCCCTGC GGCACGCCGC CTGCATCTTC 

601 AGACGGCATT TGTGCCAACA GCGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCCAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

701 TAAGCGTCAA AATGCGCCGC AAACCCGTCC AAAACCATAA CCGCCCCACA 

751 CAAATATCAA AAAAACAGTG A 

This corresponds to the amino acid sequence <SEQ ID 1 14; ORF 032.a>: 

a032 .pep 

1 MRRNVPAVAV LRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFAGQR NLPLLASFAG NVYPR LVQIY IICIQAVYLA HA QTAAVHQF 

101 EQRVIAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGMQQ 

151 TAFDQPGAIL PPRRQLARQR PRIQTALRQP PQRRRKIALR QALRHAACIF 

201 RRHLCQQRKQ FFQIAPVCRH RVLRLALAHD VFQISVKMRR KPVQNHNRPT 

251 QISKKQ* 

in032/a032 88.1% identity over a 176 aa overlap 

10 20 30 40 50 60 

in032 . pep MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFADQR 

lltM II IMIIII t I I I I I I I I I I I I I I I I I t I I I I I i I II 

a032 MRRNVPAVAVLRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFAGQR 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m032 . pep HLPLXAPFADNVYPRXVQIDIICIQAVYLAHAQTAAVHQFEQGWAHRQRVAAVHGQIQH 
: I I I I I ) I I I I I III I I I I I I I I I I I I II I I I t II II 1:111111111111111 
a032 NLPLLASFAGNVYPRLVQIYIICIQAVYLAHAQTAAVHQFEQRVIAHRQRVAAVHGQIQH 

70 80 90 100 110 120 

130 140 150 160 170 

in032 . pep PVQPFLRQGFGYALGLLRRFDVGGRVGVHQAALYQPNAILPPRRKLASQRPFPQTA 
llllllllllltlllllllflllll||::|:|: ll:|||||ll:|| IN Ml 
a032 PVQPFLRQGFGYALGLLRRFDVGGRVGMQQTAFDQPGAILPPRRQLARQRPRIQTALRQP 

130 140 150 160 170 180 

a032 PQRRRKIALRQALRHAACIFRRHLCQQRKQFFQIAPVCRHRVLRLALAHDVFQISVKMRR 
190 200 210 220 230 240 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 032 shows 86.4% identity over a 176 aa overlap with a predicted ORF (ORF 032.ng) 
from gonorrhoeae: 

m032/g032 



10 20 30 40 50 60 

ni032 . pep MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFADQR 
II M I MM III :: M M I II II II II II I II M M M I I II II II M I II II | | 
g032 MRRNVPAVAVLRRPRFEAFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFAGQR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m032 . pep HLPLXAPFADNVYPRXVQIDIICIQAVYLAHAQTAAVHQFEQGWAHRQRVAAVHGQIQH 
:l I Mil Mill III IIIIIIIIIIMIIIIIIhll I II M I I M II I I I I I I 
g032 NLTLLAPFAGNVYPRFVQIYIICIQAVYLAHAQTAAVHQLEQRWAHRQRVAAVHGQIQH 

70 80 90 100 110 120 

130 140 150 160 170 

in032 . pep PVQPFLRQGFGYALGLLRRFDVGGRVGVHQAALYQPNAILPPRRKLASQRPFPQTA 
II I I I II I II II I M I I II M M I I t I M I I: tl:|IIIMI:|| Ml III 
g032 PVQPFLRQGFGYALGLLRRFDVGGRVGAHQPAFDQPGAILPPRRQLARQRPTVQTALRQP 

130 140 150 160 170 180 

g032 PQRRRKIAPRQVLRHAACIFRRHLCQQCKQFFQIAPVCRNRVLRLALAHDVFQISVKIRR 
190 200 210 220 230 240 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 1 15>: 

g033 .seq 

1 ATGGCGGCGG CGGACAAACT CTTGGGCGGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGAGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGTTG CCCAAATATC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCXSCAAAcgg 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGagtTTGC CCAAAAAGTC 

301 GAACAcaaaA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

351 GCTGTCGCTG TTTGAAAATT TCGGCTTCCG CTACACCGGC CCCGTGGACG 

401 GACACAACGT CGAGAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 

451 AAAGGCCCTC AGTTGCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 

501 ACTCGCCGAA AACGACCCcg tcaAATACCA CGCCGTCGCc aACCTGCcta 

551 AAGAAGGCGG GGCGCAAATg ccGTCTGAAA AAGAACCCAA GCCCGCCgCc 

601 aaaccgACCT ATACCCAAGT ATTCGGCAAA TGGCTGTGCG ACCGGGCGGC 

651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 
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701 GACTGGTGGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

751 ATCGCCGAGC AGCACGCCGT tacCTTTGCC GGCGGTTTGG CGTGCGAAGG 

801 CATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 

901 GTCGACCGTG CGGGCATCGT CGGCGCGGAC GGTCCGACCC ATGCCGGCTT 

951 GTACGATTTG AGCTTCTTGC GCTGTGTGCC GAACATGATT GTTGCCGCGC 

1001 CGAGCGATGA AAACGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCG 

1051 GATGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCC TTcatTGCCT TCGGCAGTAT GGTCGCCACC 

1201 GCATTGGCGG TTGCCGAAAA ACTGAACX5CC ACCGTCGCCG ATATGCGCTt 

1251 cgtcaaacCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCAcg 

1301 accGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCGGC 

13 51 GCGGTCTTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTTTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGAACA CGGCGATCCG AAAAAACTTT 
1451 TGGACGATTT GGGTTTGAGT GCCGAAGCGG TGGAACGCCG GGTGCGCGAG 
1501 TGGCTGCCGG ACCGTGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 1 16; ORF 033. ng>; 

gQ33 .pep 

1 MAAADKLLGG DRRSVAIIGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKEGGAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 lAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

301 VDRAGIVGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

351 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG IIRREGE KTA FIAFGSMVAT 

401 ALAVAE KLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 

451 AVLEVLAKHG ICKPVLLLGV ADTVTEHGDP KKLLDDLGLS AEAVERRVRE 

501 WLPDRDAAN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 17>: 

m033 .seq 

1 ATGGCGGCGG CAGACAAACT CTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCaG.CGA TATGGATGTr GATTTGCTrG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGCTG CCGAAATACC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GCCTGTTGAG TACCGTCAAA GCGCAAACGG 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

301 GAACACAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

351 GCTGTCTTTG TTTGAAAACT TCGGCTTCCG CTACACCGGC CCCGTGGACG 

401 GACACAACGT CGAAAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 

451 AAAGGCCCTC AGTTGCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 

501 ACTCGCCGAA AACGACCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 

551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 

601 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 

651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 

701 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

751 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 

801 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTACCCGT TTTGTTTGCC 

901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTCT . 

951 GTACGATTTG AGCTTTTTGC GCTGCGTGCC GAACATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAACGAATGC CGCCTGTTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCC 

1201 GCATTGGCGG TTGCCGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

1251 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 
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1301 ACCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCGGC 

1351 GCGGTGCTGG AAGTATTGGC GAAACACGGC ATCTGCAAAC CCGTTTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGGACA CGGCGATCCG AAAAAACTTT 

1451 TAGACGATTT GGGCTTGAGT GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 

1501 TGGCTGTCGG ATCX^GGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 1 18; ORF 033>: 

mOBS .pep 

1 MAAADKLLGS DRRSVAIIGD GAMTAGQAFE ALNCAXDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTIiAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTOVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 lAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

301 VDRAGIVGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

351 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG IIRREGE KTA FIAFGSMVAP 

401 ALAVAE KLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 

451 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDDLGLS AEAVERRVRA 

501 WLSDRDAAN* 

The following partial DNA sequence was identified in K meningitidis <SEQ E) 119>: 

a033.seq 

1 ATGGCGGCGG CGGACAAACA GTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGTCA GGCGTTTGAA GCCTTGAACT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGTGCGTTG CCCAAATACC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCGCAAACGG 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

301 GAACATAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

351 ACTGTCTTTG TTTGAAAACT TCGGCTTCCG CTATACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTCGATGTAT TGGAAGACCT GCGCGGACGC 

4 51 AAAGGCCCGC AGCTTCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 

501 ACTCGCCGAA AACGATCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 

551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 

601 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 

651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 

701 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

7 51 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 

801 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 

901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTTT 

951 GTACGATTTA AGCTTTTTGC GCTGCATTCC GAATATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAATGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGTGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCT 

1201 GCATTGGCGG TCGCCGGAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

1251 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 

1301 ACCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCAGC 

1351 GCGGTGCTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTCTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGGACA CGGCGATCCG AAAAAACTTT 

1451 TAGACGATTT GGGCTTGAGT GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 

1501 TGGCTGTCGG ATCGGGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 120; ORF 033.a>: 

a033.pep 

1 MAAADKQLGS DRRSVAIIGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLEDLRGR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 lAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 
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301 VDRAGIVGAD GPTHAGLYDL SFLRCIPNMI 

351 DAPAAVRYPR GTGTGVPVSD GMETVEIGKG 

4 01 ALAVAGKLNA TVADMREVKP IDEELIVRLA 

451 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP 

501 WLSDRDAAN* 

m033/a033 98.4% identity over a 509 aa overlap 

10 20 30 40 50 60 

in033 . pep MAAADKLLGSDRRSVAIIGDGAMTAGQAFEALNCAXDMDVDLLWLNDNEMSISPNVGAL 
llilll llilllllllllllllllllllllllll I I I I I I I I I I I I I I I 1 i I I I I I I I 
a033 MAAADKQLGSDRRSVAIIGDGAMTAGQAFEALNCAGDMDVDLLVVLNDNEMSISPNVGAL 
10 20 30 40 50 60 

70 80 90 100 110 120 

m033 . pep PKYLASNWRDMHGLLSTVKAQTGKVLDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 
lllllllllllllllllllllilllllllllllillllllllllllMIMIIIIIIill 
a033 PKYLASNVVRDMHGLLSTVKAQTGKVLDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m033 . pep FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 
M I I I I I t I I I I I I I I I I I I ) I I I : I I I : I I I I I I I I I I I t I I I I I I I I I I i I I I I I I I I 
a033 FENFGFRYTGPVDGHNVENLVDVLEDLRGRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 

130 140 150 160 170 180 

190 200 210 220 230 240 

in033 . pep NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 

MINI Illllllllll llllllllltlMMIIIIIIII Ill 

a033 NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 
190 200 210 220 230 240 

250 260 270 280 290 300 

m033 , pep RFPDRYFDVGIAEQHAVTFAGGLACEGMKPVVAIYSTFLQRAYDQLVHDIALQNLPVLFA 
I N I I I i I M I I I I I I I [ i I I I I I I t I I I I I I I I I i I I I I I I j I I I i I I It I I I I I I I j j 
a033 RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 

250 260 270 280 290 300 

310 320 330 340 350 360 

in033 . pep VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 
i I I I I I I I I I I I I I I I I I I I I I I I I : I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a033 VDRAGIVGADGPTHAGLYDLSFLRCIPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 
310 320 330 340 350 360 

370 380 390 400 410 420 

ra033.pep GTGTGAPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAEKLNATVADMRFVKP 
MtlhllllMMIIMIIIIIIIItllllfllllMlllliii M I I I I I I I I I I I I 
a033 GTGTGVPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAGKLNATVADMRFVKP 

370 380 390 400 410 420 

430 440 450 460 470 480 

ra033 . pep IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 

1(11 Illill 111111111:111 Illillillltll Ill 

a033 IDEELIVRLARSHDRIVTLEENAEQGGAGSAVLEVLAECHGICKPVLLLGVADTVTGHGDP 

430 440 450 460 470 480 

490 500 510 

m033 .pep KKLLDDLGLSAEAVERRVRAWLSDRDAANX 
I I I I I I t I I I I I I I I I I II I II I I I I II II 
a033 KKLLDDLGLSAEAVERRVRAWLSDRDAANX 
490 500 510 



VAAPSDENEC RLLLSTCYQA 
IIRREGE KTA FIAFGSMVAP 
RSHDRIVTLE EKAEQGGAGS 
KKLLDDLGLS AEAVERRVRA 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 033 shows 98.4% identity over a 509 aa overlap with a predicted ORF (ORF 033.ng) 
from N. gonorrhoeae: 
in033/g033 



m033 . pep 
g033 

m033 .pep 
g033 

m033 .pep 
g033 

mO 3 3 . pep 
g033 

m033 .pep 
g033 

m033 .pep 
g033 

m033 .pep 
g033 

tn033 .pep 
g033 

m033 .pep 
g033 



MAAADKLLGSDRRSVAI IGDGAMTAGQAFEALNCAXDMDVDLLWLNDNEMS I SPNVGAL 

IIIMIIIhlllilllllllllllllllllMII lllllllillllllllllllllll 

MAAADKLLGGDRRSVAI I GDGAMTAGQAFE ALNCAGDMDVDLLWLNDNEMS I S PNVGAL 



FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 

IIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 
NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 

IIMhMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIll 

NLPKEGGAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 
RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 

lllllllll Illllllllllllllllll II lllllll II IIIIIIIIIIIIIIIIIIMI 

RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 



60 



60 



120 



PKYLASNWRDMHGLLSTVKAQTGKVLDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIillllllllllllll 

PKYIASNVVRDl^GLLSTVKAQTGKVIiDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 120 



180 



180 



240 



240 



300 



300 



360 



VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 

Mlllllllllllllllllllllllllllllllllllllllllllillllllllllllll 

VDRAGIVGADGPTHAGLYDLSFLRCVPNMI VAAPSDENECRLLLSTCYQADAPAAVRYPR 360 
GTGTGAPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAEKLNATVADMRFVKP 420 

IIMIIIIMIIIIIIIIIIIIIIIIIIIIillllllll llllllllllllllllllll 

GTGTGAPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVATALAVAEKLNATVADMRFVKP 420 
IDEELIVRLARSHDRIVTIiEENAEQGGAGGAVLEVLAKHGICKPVLLIiGVADTVTGHGDP 480 

iiiiiiiiiiiMiiiiiiiiiiiiiiiiiiiiiiiiiNiiMiiiiiiiiiii nil 

IDEELI VRIiARSHDRI VTLEENAEQGGAGGAVLEVLAKHG I CKPVLLLGVADTVTEHGD P 480 

KKLLDDLGLSAEAVERRVRAWLSDRDAANX 510 

Illllllllllllllllll II lllllll 
KKLLDDLGLSAEAVERRVREWLPDRDAANX 510 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 121>: 

g034.seq 

1 ATGAGCCGTT TATGGTTTTT TGCCGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGACCA CGCCGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

201 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGcggGCG 

251 CGCCGTTTTT GCGCCACCTG ATTCTGGCGG CAGTCGAAGA ATTTCCGCAC 

301 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTgtgCCA 

351 ACGCTCCATC CAACTGGGCT TCTCCTCCGT GATGATGGAC GGCTCTTTGC 

401 TCGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACC 

451 CGTACCGTCG TCAACTTCTC CCACGCCTGC GGCGTGTCCG TCGAAGGCGA 

501 AATCGGCGTA TTGGGCAACC TCGAAACCGG CGAAGCAGGC GAAGAAGACG 

551 GAGTGGGCGC GGCAGGCAAA CTCTCACACG ACCAAATGCT CACCAGCGTT 

601 GAAGATGCCG TGCGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCGAT 

651 TGCCGTCGGC ACCAGCCACG GCGCATACAA ATTCACCCGT CCGCCCACAG 
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701 GCGACGTATT GCGTATCGAC CGCATCAAGG AAATCCACCA AGCCCTGCCC 

751 AATACACACA TCGTGATGCA CGgctCCAGC TCCGTTCCGC AAGAatgGCT 

801 GAAAGTCATC AACGAATACG GCGGCAATAT CGGCGAAACC TACGGCGTGC 

851 CGGTTGAAGA AATCGTCGAA GGCATCAAAC ACGGCGTGCG CAAAGTCAAC 

901 ATCGATACCG ACCTGCGCCT CGCTTCCACC GGCGCGGTAC GCCGCTACCT 

951 TGCCGAAAAC CCGTCCGACT TTGATCCGCG CAAATACTTG GGCAAAACCA 

1001 TTGAAGCGAT GAAGCAAATC TGCCTCGACC GTTATCTTGC GTTCGGTTGC 

1051 GAAGGTCAGG CAGGCAAAAT CAAACCTGTT TCGTTGGAAA AAATGGCAAG 

1101 CCGTTATGCC AAGGGCGAAT TGAACCAAAT CGTCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 122; ORF 034.ng>: 



1 MSRLWFFAVK NIIIRLIYLL PKETQMALVS MRQLLDHA AE NSYGLPAFNV 

51 NNLEQMRAIM EAADQVNAPV IVQASAGARK YAGAPFLRHL ILAAVEEFPH 

101 IPWMHQDHG ASPDVCQRSI QLGFSSVMMD GSLLEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGEAG EEDGVGAAGK LSHDQMLTSV 

201 EDAVRFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMHGSS SVPQEWLKVI NEYGGNIGET YGVPVEEIVE GIKHGVRKVN 

301 IDTDLRLAST GAVRRYLAEN PSDFDPRKYL GKTIEAMKQI CLDRYLAFGC 

351 EGQAGKIKPV SLEKMASRYA KGELNQIVK* 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 123>: 

ia034.seq (partial) 



1 ATGAGCTGTT TATGGTTTTT TGCTGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA wACAGCTACG GCyTGCCGGC GTTCAACGTC 

151 AACAACCTCG wACAGATGCG CGCCATCATG GAGGCTGCAG ACCAAGTCGA 

201 CGCCCCCGTC ATCGTACAGG CGAGTGCCGG TGCGCGCAAA TATGCGGGTG 

251 CGCCGTTTTT ACGCCACCTG ATTTTGGCGG CTGTCG7WVGT ATTTCCACAC 

301 ATCCCCGTCG TCATGCACCA AGACCACGGC GCATCACCCG ACGTGTGCCA 

351 ACGCTCCATC CAACTGGGCT TCTCCTCTGT AATGATGGAC GGCTCGCTGA 

401 TGGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACA 

451 CGTACCGTGG TTAACTTCTC CCACGCTTGC GGCGTATCCG TTGAAGGCGA 

501 AATCGGCGTA TTGGGCAACC TCGAAACCGG CGATGCAGGC GAAGAAGACG 

551 GTGTAGGCGC AGTGGGCAAA CTTTCCCACG ACCAAATGCT GACCAGCGTC 

601 GAAGATGCCG TATGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCTAT 

651 TGCCGTCGGC ACCAGCCACG GCGCATACAA ATTCACCCGT CCGCCCACAG 

701 GCGATGTATT ACGTATCGAC CGCATCAAAG AAATCCACCA AGCCCTGCCC 

751 AATACACACA TCGTGATGCA C. . . 



This corresponds to the amino acid sequence <SEQ ID 124; ORF 034>: 

in034.pep (partial) 



1 MSCLWFFAVK NIIIRLIYLL PKETQMALVS MRQLLDHAA E XSYGLPAFNV 

51 NNLXQMRAIM EAADQVDAPV IVQASAGARK YAGAPFLRHL ILAAVEVFPH 

101 IPWMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGDAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVCFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMH... 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 125>: 



1 ATGAGCCGTT TATGGTTTTT TGCCGCAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

201 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGCGGGCG 

251 CGCCGTTTTT GCGCCACCTG ATTTTGGCGG CTGTCGAAGA ATTTCCGCAC 

301 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTGTGCCA 

351 ACGCTCCATC CAACTGGGCT TTTCCTCCGT GATGATGGAC GGCTCGCTGA 

401 TGGAAGACGG CAAAACCCCT TCTTCTTATG AATACAACGT CAACGCCACC 



g034.pep 



a034 . seq 
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451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 



CGTACCGTGG 
AATCGGCGTA 
GTGTAGGCGC 
GAAGATGCCG 
TGCCGTCGGC 
GCGACGTGTT 
AATACACACA 
GAAAGTCATC 
CGGTTGAAGA 
ATCGATACCG 
TGCCGAAAAC 
TTGAAGCGAT 
GAAGGTCAGG 
CCGTTATGCC 



TT/VATTTCTC 
TTGGGCAACC 
AGTGGGCAAA 
TGCGTTTCGT 
ACCAGCCACG 
GCGTATCGAC 
TCGTGATGCA 
AACGAATACG 
AATCGTCGAA 
ACTTGCGCCT 
CCGTCCGACT 
GAAGCAAATC 
CAGGCAAAAT 
AAGGGCGAAT 



CCACGCCTGC 
TCGAAACTGG 
CTTTCCCACG 
TAAAGATACC 
GCGCGTACAA 
CGCATCAAAG 
CGGCTCCAGC 
GCGGCAATAT 
GGCATCAAAC 
TGCTTCCACC 
TCGATCCGCG 
TGCCTCGACC 
CAAACCGGTT 
TGAACCAAAT 



GGCGTATCCG 
CGAAGCCGGC 
ACCAAATGCT 
GGCGTTGACG 
ATTCACCCGT 
AAATCCACCA 
TCCGTTCCGC 
CGGCGAAACC 
ACGGCGTGCG 
GGCGCGGTAC 
CAAATATTTG 
GCTACCTCGC 
TCCTTGGAAA 
CGTCAAATAA 



TTGAAGGCGA 
GAAGAAGACG 
CACCAGCGTC 
CATTGGCGAT 
CCGCCCACAG 
AGCCCTGCCC 
AAGMTGGCT 
TACGGCGTGC 
TAAAGTCAAC 
GCCGCTACCT 
AGCAAAACCA 
GTTCGGTTGC 
AAATGGCAAA 



This corresponds to the amino acid sequence <SEQ ID 126; ORF 034.a>: 

a034 .pep 

1 MSRLWFFAAK NIIIRLIYLL PKETQMALVS MRQLLDHA AE NSYGLPAFNV 

51 NNLEQMRAIM EAADQVNAPV IVQASAGARK YAGAPFLRHL ILAAVEEFPH 

101 IPVVMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGEAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVRFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMHGSS SVPQEWLKVI NEYGGNIGET YGVPVEEIVE GIKHGVRKVN 

301 IDTDLRLAST GAVRRYLAEN PSDFDPRKYL SKTIEAMKQI CLDRYLAFGC 

351 EGQAGKIKPV SLEKMANRYA KGELNQIVK* 



m034/a034 96.9% identity over a 257 aa overlap 



10 20 30 40 50 60 

mO 3 4 . pep MSCLWFFAVKN 1 1 IRLI YLLPKETQMALVSMRQLLDHAAEXS YGLPAFNVNNLXQMRAIM 

II I II II : I It II II II t 1 I I I II II I I II II I I I II II I I I I I I M I I i 1 II II II 
a034 MSRLWFFAAKNIIIRLIYLLPKETQMALVSMRQLLDHAAENSYGLPAFNVNNLEQMRAIM 
10 20 30 40 50 60 



70 80 90 100 110 120 

in034 . pep EAADQVDAPVIVQASAGARKYAGAPFLRHLILAAVEVFPHIPVVMHQDHGASPDVCQRSI 
lllllhlltlllllllllMMIIIIIIIIIIIII II I I I I I I I I I I t I I I M I I II I 
a034 EAADQVNAPVIVQASAGARKYAGAPFLRHLILAAVEEFPHIPWMHQDHGASPDVCQRSI 

70 80 90 100 110 120 

130 140 150 160 170 180 

m034 . pep QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGDAG 
llllllllitllllllllMllllllllltllllllllllltlllllllilllllll:M 
a034 QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGEAG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m034 .pep EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 
I t I II I I I I I I I I I I I I I I I I I I I I I I I t I II II I t I I I I I I I i i I I I I I I I I 1 I I I I I 
a034 EEDGVGAVGKLSHDQMLTSVEDAVRFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 

190 200 210 220 230 240 

250 

m034 .pep RIKEIHQALPNTHIVMH 
I I I I I I I I I I I I I II I I 

a034 RIKEIHQALPNTHIVMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEIVEGIKHGVRKVN 
250 260 270 280 290 300 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 034 shows 96.5% identity over a 257 aa overlap with a predicted ORF (ORF 034.ng) 
from N. gonorrhoeae: 
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m034/9034 

m034.pep MSCLWFFAVKNIIIRLIYLLPKETQMALVSMRQLLDHAAEXSYGLPAFOTNNLXQM^ 60 

II lllllllllllllllllllllllllllllllllllll llllllllllll llllll 
g034 MSRLWFFAVKNIIIRLIYLLPKETQMALVSMRQLLDHAAENSYGLPAFNVNNLEQMRAIM 60 

m03 4 . pep EAADQVDAPVIVQASAGARKYAGAPFLRHIiILAAVEVFPHI PWMHQDHGASPDVCQRS I 120 

lllllhlllllllllllllllllllllllllllll IIIIIIIIIMIIIIIIIIIIII 

g034 EAADQVNAPVIVQASAGARKYAGAPFLRHLILAAVEEFPHIPWMHQDHGASPDVCQRSI 120 

m034 , pep QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTVVNFSHACGVSVEGEIGVLGNLETGDAG 180 

llllllllllllhllllllllllllllllllllllllllllllllllllllllllhll 
g 0 3 4 QLGFSS VMMDGSLLEDGKTPSS YE YNVNATRTWNFSHACGVS VEGE I GVLGNLETGEAG 180 

m034 .pep EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDAIiAIAVGTSHGAYKFTRPPTGDVLRID 240 

llllllhllllllllllllllll lllllllllllllllllllllllllllllllllll 
g034 EEDGVGAAGKLSHDQMLTSVEDAVRFVKDTGVDAIiAIAVGTSHGAYKFTRPPTGDVLRID 240 

m034 .pep RIKEIHQALPNTHIVMH 257 

IIIIIIIIMIIIIIII 

g034 RIKEIHQALPNTHIVMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEIVEGIKHGVRKVN 300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 127>: 

g036.seq 

1 ATGCTGAAGC CGTGTTTGGT ATACAGTGCC TGTGCGGCGG cgttgcCTGC 

51 GCGGACTTCG AGCAGCAGGC GTTGCGTGCC TTCGGGCAGA TGTGCGTACC 

101 AATATTCGAG CAGGGCGGAC GCAACGCCCC GTCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCTGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACAAG CGCGGACTCA AATTGGCGTT GCGTCCACGC GGACGGGTTG 

301 CAGACGGTAT CGAGCGCGGC CAGTGCGGCG CAGTCGGACG GTGAGGCTGG 

351 GCGGATGTTC ATGTTCGTGC CTTCCGTTCC GCCTGTTCTT TGGCAGTCAG 

401 GGCGATTTTG TTGCGGACGT AGAGCAGTTC GGCGTGTGCC GCGCCAGTTG 

451 CGGGATAGCC GCCGCCGAGG GCGAGCGCGA GAAAATCGGC GGCGGTCGGC 

501 ATATCGGGTT TGCCTGAGAA GGGCGGACGG TTTTCCAGTG CGAACGCACT 

551 GCCGATGCCG TCTGAAAAGA CGTACCCCTC GGGGAGGGCA ATGTCTGCCG 

601 CCCTACCGAC TTGATAATCG CTCAAACGGC GGCGGTTCAG CGTGTCGAAC 

651 CACGCATAAA ACACTTCGCC CATACGCGCG TCCGCAGCGG CGAGTATGCA 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGTG GGGATGCCGA 

751 TTAAAGGCGT GTCGAACGGC GTTGCCAAAC CTTGCGCCAC GCCGATGCCG 

801 ATACGCAGTC CGGTAA 

This corresponds to the amino acid sequence <SEQ ID 128; ORF 036.ng>: 

g036.pep 

1 MLKPCLVYSA CAAALPARTS SSRRCVPSGR CAYQYSSRAD ATPRRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA ILPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTVSSAASAA QSDGEAGRMF MFVPSVPPVL WQSGRFCCGR RAVRRVPRQL 

151 RDSRRRGRAR ENRRRSAYRV CLRRADGFPV RTHCRCRLKR RTPRGGQCLP 

201 PYRLDNRSNG GGSACRTTHK TLRPYARPQR RVCSFAAAAA RRRHRAWGCR 

251 LKACRTALPN LAPRRCRYAV R* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 129>: 

m036. seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCg.CGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 



wo 99/57280 




PCTAJS99/09346 



351 GCGGATGTTC ACGGGCGCGC TCTCCGTTCG GCCTGTTCTT TGGCAGTCAG 

401 GGCGATTTTG TTGCGGACGT AGAGCAAACC GGCGTGTGCG GCATGGACGG 

451 CAGGATAACC GCCCTTGGCT GCCAATGCGA GAAAGTCGGC GGCAGTCGGC 

501 ATATCCGGTC TGCCTGAGAA CGGCGGAGCT TCTTCCAGCG C6AACGCGCT 

551 GCCTATGCCG TCTGAAAAGG CGCATCCCTC CGGCAGCCGG ATGTCTGCCG 

601 CCCGCCCGAC CTGATAATCG CTCAAACGGT GGCAGTTCAG CGTATCGAAC 

651 CATGCATAAA ACACTTCGCC CATACGAGCG TCCGTAGCGG CAAGGATGCA 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGAG GGTACGCCGA 

751 TTAAGGGGGT ATCAAACGGC GTTGCCAAAC CCTGAGCTAC ACCGATGCCG 

801 ATACGCAGTC CGGTAA 

This corresponds to the amino acid sequence <SEQ ID 130; ORF 036>: 

m036.pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPXSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 

151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 

201 PARPDNRSNG GSSAYRTMHK TLRPYERP*R QGCSFAAAAA RRRHRARVRR 

251 LRGYQTALPN PELHRCRYAV R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 131>: 

a036 . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCACGC GGACGGGTTG 

301 CAGACGGCAT CGAGCGCGGC GAGTGCGGCG CAATCGGCAT AAACGGCGCG 

351 GCGGATGTTC ACAGGCGCGC CCTCCGTTCC GCCTGTTCTT TGGCAGTCAA 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAGCTC GGCGTGTGCC GCAGCGACGG 

4 51 CGGGAAAACC GCCTTCAGCC GCCAGATTGA GGAAGTCGGC GGCGGTCGGC 

501 ATATCGGGTT TGCCTGAGAA GGGCGGACGG TTTTCCAGCG CGAACGCATT 

551 GCCGATGCCG TCTGAAAAGG CGCATCCTTC CGGCAGCCGG ATGTCTGCCG 

601 CCCGACCGAC CTGATAATCG CTCAAACGGC GGCGGTTCAG CGTGTCGAAC 

651 CATGCATAAA ACACTTCGCC CATACGTGCG TCCGCAGCGG CAAGGATGCA 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGAG GGTACGCCGA 

751 TTAAAGGAGT ATCAAACGGC GTTGCCAAAC CTTGCGCCAC GCCGATGCCG 

801 ATACGCAGTC CCGTAA 

This corresponds to the amino acid sequence <SEQ ID 132; ORF 036.a>: 

a036.pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASAA QSA*TARRMF TGAPSVPPVL WQSRRFCCGR RAARRVPQRR 

151 RENRLQPPD* GSRRRSAYRV CLRRADGFPA RTHCRCRLKR RILPAAGCLP 

201 PDRPDNRSNG GGSACRTMHK TLRPYVRPQR QGCSFAAAAA RRRHRARVRR 

251 LKEYQTALPN LAPRRCRYAV P* 

m036/a036 85.6% identity over a 270 aa overlap 

10 20 30 40 50 60 

ra036.pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
I I I I I I I I I I M I i I I I I I I I I I I I I I I I ] I I i t I I I I t I M I I I I I I I I t I I I I I I I I I 
a036 MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
10 20 30 40 50 60 

70 80 90 100 110 120 

m036.pep GRFCQTIKAAIPXSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
I I I I I M i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I :: I I I I I I I I I 
a036 GRFCQTIKAAIPPSFSARKTCSDGETSADSNWRCVHADGLQTASSAASAAQSAXTARRMF 

70 80 90 100 110 120 
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130 140 150 160 170 180 

mO 3 6 . pep TGALS VRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 

III II Mini llllllil III : t::|| I 111:111 lilt I: :|) 
a036 TGAPSVPPVLWQSRRFCCGRRAARRVPQRRRENRLQPPDXGSRRRSAYRVCLRRADGFPA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m036. pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCSFAAAAA 
I I : I I t I I I I I llllllil 111111111:11 t I I I I I I II i II I I I I I I I I I II 
a036 RTHCRCRLKRRILPAAGCLPPDRPDNRSNGGGSACRTMHKTLRPYVRPQRQGCSFAAAAA 

190 200 210 220 230 240 

250 260 270 

m036 . pep RRRHRARVRRLRGYQTALPNPELHRCRYAVRX 

I I I I I I I I I I I : I I I I I I I- : M I I I I 
a036 RRRHRARVRRLKEYQTALPNLAPRRCRYAVPX 

250 260 270 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. ^gonorrhoeae 

ORF 036 shows 74.9% identity over a 271 aa overlap with a predicted ORF (ORF 036.ng) 
from A^. gonorrhoeae: 

m036/g036 

10 20 30 40 50 60 

m036.pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
I I I I I I I I I I I I : I I II I I I I I I I I nil: llllllil I I I I I I I I I I I I I M I I 
g036 MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 
10 20 30 40 50 60 

70 80 90 100 110 120 

mO 3 6 . pep GRFCQT I KAAI PXS FS ARKT C S DGET S ADSNWRC VHADGLQTAS S AAS S SQS AQT ARRMF 

llltlllllli lllllltllllllllllllllllltllll:|j|ll::ll I III 
g036 GRFCQTIKAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 

70 80 90 100 110 120 

130 140 150 160 170 180 

m036 . pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
: t I I I I I I t I I I II I I I I I I I : :|:| I I : I I : I t I I I I I I : : I : 

g036 MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 

130 140 150 160 170 180 

190 200 210 220 230 240 

m036 . pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCSFAAAAA 
ll:| IIIIN I :: I I I I- I llllll|:|| II lllllll || I: [HMIII 
g036 RTHCRCRLKRRTPRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 

190 200 210 220 230 240 

250 260 270 

ra036 . pep RRRHRARVRRLRGYQTALPNPELHRCRYAVRX 
I I I I M 11:: : II I I I : I I I I I I M 
g03 6 RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 

250 260 270 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 133>: 

m036-l . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 
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351 GCGGATGTTC ACGGGCGCGC 

4 01 GGCGATTTTG TTGCGGACGT 

451 CAGGATAACC GCCCTTGGCT 

501 ATATCCGGTC TGCCTGAGAA 

551 GCCTATGCCG TCTGAAAAGG 

601 CCCGCCCGAC CTGATAATCG 

651 CATGCATAM ACACTTCGCC 



TCTCCGTTCG GCCTGTTCTT TGGCAGTCAG 
AGAGCAAACC GGCGTGTGCG GCATGGACGG 
GCCAATGCGA GAAAGTCGGC GGCAGTCGGC 
CGGCGGAGCT TCTTCCAGCG CGAACGCGCT 
CGCATCCCTC CGGCAGCCGG ATGTCTGCCG 
CTCAAACGGT GGCAGTTCAG CGTATCGAAC 
CATACGAGCG TCCGTAG 



This corresponds to the amino acid sequence <SEQ ID 134; ORF 0036-l>: 

m036-l .pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 
51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 
101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 
151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 
201 PARPDKRSNG GSSAYRTMHK TLRPYERP* 



iii036-i/g036 76.8% identity in 228 aa overlap 

10 20 30 40 50 60 

m036-l.pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
Mill I I I 1 I I I : I I I I I I I I I I I I Mil: MIMIII I I I I I I I I I I II M I 11 
g036 MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 
10 20 30 40 50 60 



70 80 90 100 110 120 

ra036-l.pep GRFCQTIKAAIPPSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
I I I I I I I II II I I M I I II II I I II II I II II M II I II I I : I I I II :: II | | | | 
g036 GRFCQTIKAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 

70 80 90 100 110 120 

130 140 150 160 170 180 

mO 3 6-1 .pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
: ! I I I II I I I I I M 1 M I I I I : :|:| ll:||:|lt |||| |: :|: 

g036 MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 

130 140 150 160 170 180 

190 200 210 220 229 

m036-l . pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPX 

11:1 MINI i : : lilt I I I I I I I I : I I II II I I I I I II 
g036 RTHCRCRLKRRTPRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 

190 200 210 220 230 240 

g03 6 RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 
250 260 270 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 135>: 

g038 ,seq 

1 ATGACTGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTC AACGACGGCG CGTCCACGCT GCAACTGGCA 

151 AAATTCTATG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATCCT 

201 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC GCGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCAATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

451 ATCGCGCTCG ACCGCATGGA AAAAGGCACG GGTAAATTGT CCGCCGTTCA 

501 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATCCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GCACCTACCG CCGGCAGTAC GGCGTAGAAT AA 
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This corresponds to the amino acid sequence <SEQ ID 136; ORF 038.ng>: 

g038.pep 

1 MTDFRQDFLK FSIAQNVUCF GEFTTKAGRR SPYFFNAGLF NDGASTLQIA 

51 KFYAQSIIES GXRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYMRKE 

101 AKDRGE66VL VGAPLKGRVL IIDDVISAGT SVKESIKLIS AEGAXFAGVA 

151 lALDRMEKGT GKLSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRTYRRQY GVE* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 137>: 

m038 .seq 

1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAGGC AGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AAC6ACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGTCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGC6TGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCGATCAA ACTGATTGAA GCGGAGGGTG CAACCCCcGC CGGTGTCGCC 

451 ATCGCGCTCG ATCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAr AAACAATACG GkCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 

This corresponds to the amino acid sequence <SEQ ID 138; ORF 038>: 

m038 ,pep 

1 MTDFRQDFLK FSIAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV MVPFAYNRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 lALDRMEKGT GELSAVQEVE KQ YGLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRAYRRQY GVE* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 139>; 

a038.seq 

1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTCA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACMTCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGCCCC GCCTACAMG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCGATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

451 ATCGCGCTCG ACCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 

This corresponds to the amino acid sequence <SEQ ID 140; ORF 038.a>: 

a038 .pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 lALDRMEKGT GELSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRAYRRQY GVE* 

m038/a038 100.0% identity over a 213 aa overlap 



10 20 30 40 50 60 

m038.pep MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
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I I I I I I I I I I I I 1 I I I I I M I t I I I I i I I I I I I t I I I I t I t I I I I I I I I I 1 i I I I I I I I I 
a038 MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
10 20 30 40 50 60 

70 80 90 100 110 120 

m038 . pep GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 
I I I I I I I I I I 1 I N I I I I I I I I I I I I I I I j I M M I I I I I I I I t I I I I I I I I I I I I I I I I 
a038 GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 
70 80 90 100 110 120 

130 140 150 160 170 180 

m038 . pep IIDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 i 1 1 

a038 IIDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 
130 140. 150 160 170 180 

190 200 210 

m038 . pep ASLNDLFILLQNNPEFGQFLEPVRAYRRQYGVEX 
I I I M I I I I I I t I I I I I I I I I I I I I t I I I I I I I I 
a038 ASLNDLFILLQNNPEFGQFLEPVRAYRRQYGVEX 

190 200 210 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 038 shows 98.1% identity over a 213 aa overlap with a predicted ORF (ORF 038.ng) 
from N, gonorrhoeae: 

m038/g038 

10 20 30 40 50 60 

MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
I I I I I I I I ] I I I I I I I I 1 I I i I I I I I I M t I I I I f I I I I f I I I I I I I M I I I i I I I I I I 
MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGASTLQLAKFYAQSIIES 
10 20 30 40 50 60 

70 80 90 100 110 120 

GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 
I I I I I M I I M I I I I I I I I I I I I t M I M t I I M I I I I I I I I I : I I t I I I I I I I 1 I I [ I I 
GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDRGEGGVLVGAPLKGRVL 
70 80 90 100 110 120 

130 140 150 160 170 180 

IIDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 
I I I I I I M I I I I I I I I I I i i I I 1 I I I I I I I I I I I I I I I I I I : t I I I I I t I I I I I I I I I I I 
IIDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGKLSAVQEVEKQYGLPVAPI 
130 140 150 160 170 180 

190 200 210 

ASLNDLFILLQNNPEFGQFLEPVRAYRRQYGVEX 
I I I I I I I t I I i I I I I I I I I I I I 1 I : I I I I I I I I I 
ASLNDLFILLQNNPEFGQFLEPVRTYRRQYGVEX 
190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 141>: 

g039 .seq 

1 ATGCCGTCCG AACCACCTGC CGCTTCAGAC GGCATCAAAC CGACACACAC 

. 51 CGAGAAAACA TCATGCCCGC CTGTTTCTGT CCGCACTGCA AAACCCGCCT 

101 CTGGGTCAAA GAAAcccagC TCAAcgtCgC ccaagGCTTC GTCGTCTgcc 

151 aaAAAtgcga agGGCTgttt aaAgccaaaG accAtctggc aaGcacGAAA 

201 gaacctatat tcaacgattg gcccgaagct gtttcgggat gTcaaaCTCG 



m038.pep 
g038 

m038 .pep 
g038 

m038 .pep 
g038 

m038 .pep 
g038 
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2 51 TCcaccgcaT cggcacgcac gccattagca aGAaacagat gtcccgcgac 

301 gaaatCgccg atatcctcaa cggcggtaca acCCTGCACG ATACGCCGCC 

351 CGCAACCGCC GCTGCCGCac CtGCCGCCGC ACCGCaggTT TCCGTACCGC 

4 01 CCGCCCGTCA GGAAGGGCTC AACTGGACTA TTGCAACCCT GTTCGCACTT 

4 51 ATCGTCCTCA TTATGCAGCT TTCCTACCTC TTCATCCTAT GA 

This corresponds to the amino acid sequence <SEQ ID 142; ORF 039.ng>: 

g03 9.pep 

1 MPSEPPAASD GIKPTHTEKT SCPPVSVRTA KPASGSKKPS STSPKASSSA 

51 KNAKGCLKPK TIWQARKNLY STIGPKLFRD VKLVHRIGTH AISKKQMSRD 

101 EIADILNGGT TLHDTPPATA AAAPAAAPQV SVPPARQEGL NWTIAT LFAL 

151 IVLIMQLSYL FIL * 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 143>: 



zn039.s6g 












1 


ATGCCGTCCG 


AACCGCCTTA 


CGCCTCAGAC 


GGCATCAAAC 


CTGACACACA 


51 


CGAGGAAATA 


CCATGCCCGC 


CTGTTTCTGC 


CCCCACTGCA 


AAACCCGTCT 


101 


CTGGGTCAAA 


GAAACCCAAC 


TCAATGTCGC 


CGnnnnnnnn 


nnnnnnnnnn 


151 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


201 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnCCC 


GAGGCTGTTT 


251 


CGGATGTCAA 


ACTCGTTCAC 


CX3TATCGGCA 


CGCGCGCCAT 


CGGCAAGAAA 


301 


CAGATTTCCC 


GTGACGAAAT 


CGCCGGCATC 


CTCAACGGCG 


GTACAACCCA 


351 


GCCCGATATT 


CCGCCCGCAA 


CCGCCGCCAC 


CCCTGCTGCC 


GCACCGCAGG 


401 


TTACCGTACC 


GCCCGCCGCG 


CCCGCCCGTC 


AGGATGGGTT 


CAACTGGACG 


451 


ATTGCAACCC 


TGTTTGCCCT 


TATCGTCCTC 


ATTATGCAGC 


TTTCCTACCT 


501 


CGTCATCCTA 


TGA 









This corresponds to the amino acid sequence <SEQ ID 144; ORF 039>: 

in039.pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXP EAVSDVKLVH RIGTRAIGKK 

101 QISRDEIAGI LiNGGTTQPDI PPATAATPAA APQVTVPPAA PARQDGFNWT 

151 lAT LFALIVL IMQLSYLVIL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 145>: 

a039.seq 

1 atgccgtctg aaccgcctta cgcctcagac ggcatcaaac ctgacacaca 

51 cgaggaaata ccatgcccgc ctgtttctgc ccccactgca aaacccgtct 

101 ctgggtca;^ gaaacccaac tcaatgtcgc ccaaggcttc gtcgtctgcc 

151 aaaaatgcga aggaatgttt aaagccaaag accatctggc aagcacgaaa 

201 gaacccatat tcaacgatt. tgcccgaagc tgtttcggat gtcaaactcg 

251 TTCACCGCAT CGGCACGAGC GCCATCGGCA AGAAACAGAT TTCCCGTGAC 

301 GAAATCGCCG GCATCCTCAA CGGCGGCACA ACCCAGCCCG ATATTCCGCC 

351 CGCAACCGCC GCCACCCCTG CTGCCGCACC GCAGGTTACC GTACCGCCCG 

401 CCGCGCCCGC CCGTCAGGAT GGGTTCAACT GGACGATTGC AACCCTGTTT 

451 GCCCTTATCG TCCTCATTAT GCAGCTTTCC TACCTCGTCA TCCTATGA 

This corresponds to the amino acid sequence <SEQ ID 146; ORF 039.a>: 

a039.pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPKASSSA 

51 KNAKECLKPK TIWQARKNPY STIXPEAVSD VKLVHRIGTS AIGKKQISRD 

101 EIAGILNGGT TQPDIPPATA ATPAAAPQVT VPPAAPARQD GFNWTIATLF 

151 ALIVLIMQLS YLVI L* 

m039/a039 79.4% identity over a 1 70 aa overlap 

10 20 30 40 50 60 

m039 . pep MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPXXXXXXXXXXXXXXXX 

I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I i I I I ) I I I I t I 
a039 MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPKASSSAKNAKECLKPK 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m03 9 . pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDEIAGILNGGTTQPDI 
: : I I I M I I j I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I 

a039 TIWQARJCNPYSTIX PEAVSDVKLVHRIGTSAIGKKQISRDEIAGILNGGTTQPDI 

70 80 90 100 110 

130 140 150 160 170 

m039 . pep PPATAATPAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 
I I I I I I I I I I I I I I I I I I i I I I M I I I I I I I I I I I I I I I I I I I I I t I M I I 
a039 PPATAATPAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 
120 130 140 150 160 

Computer analysis of this amino acid sequience gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 039 shows 60.8% identity over a 171 aa overlap with a predicted ORF (ORF 039.ng) 
from A^. gonorrhoeae: 

3n039/g039 

10 20 30 40 50 60 

m03 9 . pep MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPXXXXXXXXXXXXXXXX 

mill lllllll h lllll: lllhlllllhl II 
g03 9 MPSEPPAASDGIKPTHTEKTSCPPVSVRTAKPASGSKKPSSTSPKASSSAKNAKGCLKPK 

10 20 30 40 50 60 

70 80 90 100 110 120 

m03 9 . pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDEIAGILNGGTTQPDI 

h IIIMIIIIhlhllhllllll lllllll I 

g03 9 TIWQARKNLYSTIG PKLFRDVKLVHRIGTHAISKKQMSRDEIADILNGGTTLHDT 

70 80 90 100 110 

130 140 150 160 170 

m03 9 . pep PPATAAT-PAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 

111111= llllllhllll ihhillllllilliiiiiiiiii III 

g03 9 PPATAAAAPAAAPQVSVPPA RQEGLNWTIATLFALIVLIMQLSYLFILX 

120 130 140 150 160 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ JD 147>: 

g040 . seq 

1 ATGAACGCGC CCGACAGCTT TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 

51 CATCCGCC7VA ATGCGCGGCA CGACACTGGT CGCCGGCATA GAcggCCGCC 

101 TGCTCGAAGG CGGCACCTTA AATAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCCGACT CGTCCTCATC CACGGCGCGT ACCACTTCCT 

201 CGAccgCCTC GCCGCCGCGC AAGgccGCAC GCCGCATTAT TGCCGgggtt 

251 tGCGCGTTAC CGACGaAACc tcGctcgGAC AGGCGCAGCA GtttGCCGGC 

301 AccgTCCGCA GCCGTTTTGA agcCGCATTG tgcggcagCG tttcaggatt 

351 cgcgCGCGCG CCTTCCGTCC CGCTCGTAtc gggcaacttc ctgacCGCCC 

401 GTCcgatggg cgtgattgac ggaACCGata tggaatacgc gggggttatc 

451 cgcaaaaccg ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGATG CCGCCGCTCG GGCATTCCTA CGGCGGCAAA ACCTTCAATC 

551 TCGATATGGT GCAGGCCGCC GCTTCCGTCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

701 CCGCCAGCGA AACCCGACGA CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 

751 GGCGGCGTGC ATCGCGTCCA AATCCTCAAC GGGGCCGCCG ACGGCAGCCT 

801 GCTGCAAGAA CTCTTCACCC GCAACGGCAT CGGCACGTCC ATTGCCAAAG 

851 AAGCCTTCGT CTCCATCCGG CAGGCGCACA GCGGCGACAT CCCGCACATC 

901 GCCGCCCTCA TCCGCCCGCT GGAAGAACAG GGCGTCCTAT TGCACCGCAG 

951 CCGCGAATAC CTCGAAAACC ACATTTCCGA ATTTTCCATC CTCGAACACG 
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1001 ACGGCGACCT GTACGGCTGT GCCGCACTCA AAACCTTTGC CGAAGCCGAT 

1051 TGCGGCGAAA TCGCCTGCCT TGCCGTCTCG CCGCAGGCAC AGGACGGCGg 

1101 CtACGGCGAA CGCCTGCTTG CCCACATTAT CGATAAGGCG CGCGGCATAG 

1151 GCATAAGCAG GCTGTTCGCA CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

1201 GAACGCGGCT TTCAGACGGC ATCGGAAGAC GAGCTGCCCG AAACGCGGCG 

1251 CAAAGACTAC CGCAGCAACG GACGAAACCC GCATATTCTG GTGCGTCGCC 

1301 TGCACCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 148; ORF 040.ng>: 

g040.pep 

1 MNAPDSFVAH FREAAPYIRQ MRGTTLVAGI DGRLLEGGTL NK LAADIGLL 

51 SQLGIRLVLI HGAYHFLDRL AAAQGRTPHY CRGLRVTDET SLGQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPMGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWM PPLGHSYGGK TFNLDMVQAA ASVAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAASETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS lAKEAFVSIR QAHSGDIPHI 

301 AALIRPLEEQ GVLLHRSREY LENHISEFSI LEHDGDLYGC AALKTFAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEWFA 

4 01 ERGFQTASED ELPETRRKDY RSNGRNPHIL VRRLHR* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 149>: 

m040 . seq 

1 ATGAGCGCGC CCGACCTCTT TGTCGCCCAC TTCCGCGAAG CCGTCCCCTA 

51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 

101 TGCTCGAAGG TGATACCTTA AACAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCAGGCT CGTCCTCATC CACGGCGCGC GCCACTTCCT 

201 CGACCGCCAC GCCGCCGCTC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

251 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAgCA GTTTGCCGGC 

301 ACCGTCCGCA GCCGTTTTGA AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 

351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCCGATAGG TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 

451 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCTATC 

551 TCGATATGCT TCAAACCGCC GCCTCCGCCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

701 CCGGCGGGCA AACGCGACGG CTGATTTCGT CCGCCGAACT CTTCACCCGC 

751 AACGGCATCG GCACGTCCAT TGCCAAAGAA GCCTTCGTCT CCATCCGGCA 

801 rGCGCAywgG G.CGACATCC CGCACATCGC CGCCCTCATC CGCCCGCTGG 

851 AAGAACAGGG CATCCTGCTG CACCGCAs . C GCGAATACCT CGAAAACCAC 

901 ATTTCCGAAT TTTCCATCCT CGAACACGAC GGCAACCTGT ACGGTTGCGC 

951 CGCCCTGAAA ACCTTTGCCG AAGCCGATTG CGGCGAAATC GCCTGCCTTG 

1001 CCGTCTCGCC GCag.cACAG GACGGCGGCT ACGGCGAACG CnTGCTTGCC 

1051 CACATTATCG ATAAGGCGCG CGGCATAGGC ATAAGCAGGC TGTTCGCACT 

1101 GTCCACAAAT ACCGGCGAAT GGTTTGCCGA ACGCGGCTTT CAGACGGCAT 

1151 CGGAAGACGA GTTGCCCGAA ACGCGGCGCA AAGACTACCG CAGCAACGGA 

1201 CGGAACTCGC ATATTCTGGT ACGTCGCCTG CACCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 150; ORF 040>: 

m040.pep 

1 MSAPDLFVAH FREAVPYIRQ MRGKTLVAGI DDRLLEGDTL NK LAADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFYLDMLQTA ASAAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAGGQTRR LISSAELFTR 

251 NGIGTSIAKE AFVSIRQAHX XDIPHIAALI RPLEEQGILL HRXREYLENH 

301 ISEFSILEHD GNLYGCAALK TFAEADCGEI ACLAVSPQXQ DGGYGERXLA 

351 HIIDKARGIG ISRLFALSTN TGEWFAERGF QTASEDELPE TRRKDYRSNG 

401 RNSHILVRRL HR* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 151>: 

a040 . seq 



wo 99/57280 



PCTAJS99/09346 



228 



1 


ATGATCGTGC 


CCGACCTCTT 


51 


CATCCGCCAA 


ATGCGCGGCA 


101 


TGCTCGAAGG 


TGATACCTTA 


151 


TCGCAACTGG 


GCATCAGGCT 


201 


CGACCGCCAC 


GCCGCCGCGC 


251 


TGCGCGTTAC 


CGACGAAACC 


301 


ACCGTCCGCA 


GCCGTTTTGA 


351 


CGCGCGCGCG 


CCTTCCGTCC 


401 


GTCCGATAGG 


TGTGATTGAC 


451 


CGCAAAACCG 


ACACCGCCGC 


501 


CGTCTGGCTG 


CCGCCGCTCG 


551 


TCGATATGCT 


TCAAACCGCC 


601 


AAACTCGTTT 


ACCTGACCCT 


651 


GCTCGCCGTA 


ACCCTCTCGG 


701 


CCGGCGGCGA 


AACGCGACGG 


751 


GGCGGCGTGC 


ATCGCGTCCA 


801 


GCTGCAAGAA 


CTCTTCACCC 


851 


AAGCCTTCGT 


CTCCATCCGG 


901 


GCCGCCCTCA 


TCCGCCCGCT 


951 


CCGCGAATAC 


CTCGAAAACC 


1001 


ACGGCAACCT 


GTACGGTTGC 


1051 


TGCGGCGAAA 


TCGCCTGCCT 


1101 


CTACGGCGAA 


CGCCTGCTTG 


1151 


GCATAAGCAG 


GCTGTTCGCA 


1201 


GAACGCGGCT 


TTCAGACGGC 


1251 


CAAAGACTAC 


CGCAGCAACG 


1301 


TGCACCGCTG 


A 



TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 
AAACGCTGGT CGCCGGCATA GACGACCGCC 
AACAAGTTCG CCGCCGACAT CGGGCTTTTG 
CGTCCTCATC CACGGCGCGC GCCACTTCCT 
AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 
TCGCTCGAAC AGGCGCAGCA GTTTGCCGGC 
AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 
CGCTCGTATC GGGCAACTTC CTGACCGCCC 
GGAACCGATA TGGAATACGC GGGCGTTATC 
CCTCCGTTTC CAACTCGACG CGGGCAATAT 
GACATTCCTA CAGCGGCAAG ACCTTCCATC 
GCCTCCGTCG CCGTCTCGCT TCAGGCCGAA 
TTCAGACGGC ATTTCCCGCC CCGACGGCAC 
CACAGGAAGC GCAATCGCTG GCGGAACACG 
CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 
AATCCTCAAC GGAGCCGCCG ACGGCAGCCT 
GCAACGGCAT CGGCACGTCC ATTGCCAAAG 
CAGGCGCACA GCGGCGACAT CCCGCACATT 
GGAAGAACAG GGCATCCTGC TGCACCGCAG 
ACATTTCCGA ATTTTCCATC CTCGAACACG 
GCCGCCCTGA AAACCTTTGC CGAAGCCGAT 
TGCCGTCTCG CCGCAGGCAC AGGACGGCGG 
CCCACATTAT CGATAAGGCG CGCGGCATAG 
CTGTCCACAA ATACCGGCGA ATGGTTTGCC 
ATCGGAAGAC GAGTTGCCCG AAACGCGGCG 
GACGGAACTC GCATATTCTG GTGCGTCGCC 



This corresponds to the amino acid sequence <SEQ ID 152; ORF 040.a>: 

a040.pep 

1 MIVPDLFVAH FREAAPYIRQ MRGKTLVAGI DDRLLEGDTL NKFAADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFHLDMLQTA ASVAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAV TLSAQEAQSL AEHAGGETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS lAKEAFVSIR QAHSGDIPHI 

301 AALIRPLEEQ GILLHRSREY LENHISEFSI LEHDGNLYGC AALKTFAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEWFA 

4 01 ERGFQTASED ELPETRRKDY RSNGRNSHIL VRRLHR* 



m040/a040 91.5% identity in 436 aa overlap 



10 20 30 40 50 60 

m040.pep MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 

I :tllillllll|:|lll|j||||||]||||||illlll||:|||||||||||| 

a040 MIVPDLFVAHFREAAPYIRQMRGKTLVAGIDDRLLEGDTLNKFAADIGLLSQLGIRLVLI 

10 - 20 30 40 50 60 

70 80 90 100 110 120 

m040.pep HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 
I I I M I M I I I M I I I I I I I I I I I I I I I I I I I I I I i I I I M I I I I I 11 I I I t I I I I I I I I 
a040 HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m040.pep PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 
I I I I I I M I i I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I ] I i I I I I I t I I I I j I I I I 
a04 0 PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 

130 140 150 160 170 180 

190 200 210 220 230 240 

m04 0.pep TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 

IMMI : I I I I I I i I I I I I I I I I I I I I I ] I I I I IIIIIIIIIIKIIIMII 

a04 0 TFHLDMLQTAASVAVSLQAEKLVYLTLSDGISRPDGTLAVTLSAQEAQSLAEHAGGETRR 



wo 99/57280 ^ 



PCT/US99/09346 



229 



190 200 210 220 230 240 

250 260 270 

m040.pep LISSA ELFTRNGIGTSIAKEAFVSIRQAHXXDIPHI 

I I I I I I I I I I I t I t I i I I I I I I M I I I I I Mill 

a04 0 LISSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTSIAKEAEVSIRQAHSGDIPHI 
250 260 270 280 290 300 

280 290 300 310 320 330 

AALIRPLEEQGILLHRXREYLENHISEFSILEHDGNLYGCAALKTFAEADCGEIACLAVS 

I I II I II If I I I M I I I I I I I I I I I II t I I I I I II I II I I I I II I II I I II I I I II I II 
AALIRPLEEQGILLHRSREYLENHXSEFSILEHDGNLYGCAALKTF7VEADCGEIACLAVS 

310 320 330 340 350 360 

340 350 . 360 370 380 390 

PQXQDGGYGERXLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 

II I t I I I I I I I I I I I I I I I I I I I I M II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
PQAQDGGYGERLLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 

370 380 390 400 410 420 

400 410 
in040 . pep RSNGRNSHILVRRLHRX 
I I I I I I I I I I I I I I I I i 
a040 RSNGRNSHILVRRLHRX 
430 

Computer analysis of this amino acid sequence gave the following results: 
Homology w^ith a predicted ORF from A^. gonorrhoeae 

ORF 040 shows 88.3% identity over a 436 aa overlap with a predicted ORF (ORF 040.ng) 
from A^. gonorrhoeae: 

m040/g040 



m040 . pep MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 60 

1:111 11111111:11111111 I I I I I t I I I I I I I I I I I I I I t I I II I I I I I I I t I 
g040 MNAPDSFVAHFREAAPYIRQMRGTTLVAGIDGRLLEGGTLNKLAADIGLLSQLGIRLVLI 60 

m040 .pep HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 120 

III Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g040 HGAYHFLDRLAAAQGRTPHYCRGLRVTDETSLGQAQQFAGTVRSRFEAALCGSVSGFARA 120 

m04 0 . pep PSVPLVSGNFLTMPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 180 

I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I : I I 
g040 PSVPLVSGNFLTARPMGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWMPPLGHSYGGK 180 

m040 .pep TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 240 

II I I I : I : I I I : I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I II I I I I I I I I I ::: I II 
g040 TFNLDMVQAAASVAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAASETRR 24 0 

m040.pep LISSA ELFTRNGIGTSIAKEAFVSIRQAHXXDIPHI 27 6 

I I I I I I I I I I I I i I I I I I I I I I 1 I I I I i I Mill 

g040 LISSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTSIAKEAFVSIRQAHSGDIPHI 300 

m040.pep AALIRPLEEQGILLHRXREYLENHISEFSILEHDGNLYGCAALKTFAEADCGEIACLAVS 336 

I II II I I I M I : I I I I IIMIIIIIIMIIIIIMMMIIIMIMMMIMIMII 
g040 AALIRPLEEQGVLLHRSREYLENHISEFSILEHDGDLYGCAALKTFAEADCGEIACLAVS 360 

m04 0.pep PQXQDGGYGERXLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 396 

M I I I I II I I II II II I II I I 11 I I i I I I i I II I I I I II II II I II II I M I II I I I I 
g040 PQAQDGGYGERLLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 420 

m04 0.pep RSNGRNSHILVRRLHRX 413 

I M I I I II II I I I I I I 
g040 RSNGRNPHILVRRLHRX 437 



m04 0 .pep 
a040 



mO 4 0 . pep 
a040 
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The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 153>: 

g041 . seq 

1 ATGAGTTCGC CCAAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGCCT 

51 GATTACCGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGTGCGCTGG 

101 TGTGCGAAGT ACCGCTGACC GATATGATCC GTTATCCGCT GCTGTCCGGC 

151 GGTTCAAGTT GGACGGACGA ATACGGCAAT CCGCAGAAAT ACGAAGCCTG 

201 CAAACGCCGG CTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCACTC ATTACCACCA GCCTCAGCGA CGACCGCGTC 

301 CATCCCGCCC ACGCGCTCAA ATTCTACGCC AAACTGCGCG AAACCTCGCC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC AAACTCGCCT GCGTGTTGCT GTTTTTGAAA 

451 GAATTTTTGG GATAA 

This corresponds to the amino acid sequence <SEQ ID 154; ORF 041.ng>: 

g041.pep 

1 MSSPKHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWT DEYGN PQKYEACKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQRESAD KLACVLLFLK 

151 EFLG* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 155>: 

m041.seq 

1 ATGAGTTCGC CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC GACATGATCC GTTATCCGCT GCTCTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCCGC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC GAACTCGCCT GCGTCTTGCT GTTTTTGAAA 

4 51 GAGTTTTTGG GCTAA 

This corresponds to the amino acid sequence <SEQ ID 156; ORF 04 1>: 

in041 .pep 

1 ISSPEHIGLQ GGSNGGLITA AAEVREFQ5I GALVCEVPLT DMIRYPLLSA 

51 GSSWT DEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSAQSW LYSPDGGGHT GNGTQRESAD ELACVLLFLK 

151 EFLG* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 157>: 

a041 . seq 

1 ATCAGTTCGC CCG/VACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATA GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC GACATGATCC GTTATCCGCT GCTCTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG TU^CCTCGCC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

401 CGCAGCGCGA AGCCGCCGAC GAACTCGCCT GCGTGTTGCT GTTTTTG7VAA 

4 51 GAGTTTTTGG GCTAA 

This corresponds to the amino acid sequence <SEQ ID 158; ORF 041.a>: 

a041 .pep 

1 ISSPEHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWT DEYGN PQKYEVCKRR LGELSPYHNL SPGIDYPPAT. TTT.qT..qnnRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQREAAD ELACVLLFLK 
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151 EFLG* 

m04i/a04i 98.7% identity over a 154 aa overlap 

10 20 30 40 50 60 

m041 . pep ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I t I I I i I I I I I I I I I I I I I I I I I I t I I 1 I I I 
a041 ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m041 . pep PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPTUIALKFYAKLRETSAQSW 
I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
a041 PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 

70 80 90 100 110 120 

130 140 150 

m041 . pep LYSPDGGGHTGNGTQRESADELACVLLFLKEFLGX 
lllllllltlllltlll:llltlllllllilllll 
a 0 4 1 LYS PDGGGHTGNGTQREAADELACVLLFLKEFLGX 

130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 041 shows 96.8% identity over a 154 aa overlap with a predicted ORF (ORF 041.ng) 
from K gonorrhoeae: 

in041/g041 

10 20 30 40 50 60 

in041 . pep ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
: I I f : i I I I I I I I I I I I I I I I I I II I I I I i II I I I II I I I I I I I I II I I I I II I I I II II 
g041 MSSPKHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 

10 20 30 40 50 60 

70 80 90 100 110 120 

ra04 1 . pep PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 

I I I M : I I I I I I i I I I I I I I I I I I II I I I I I I I I II I M I I i M II I I II M I II I III 
g041 PQKYEACKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 

70 80 90 100 110 120 

130 140 150 

mO 4 1 . pep LYS PDGGGHTGNGTQRE S ADELACVLLFLKE FLGX 

II t I II II M II I I M II I I : II I I I I I II I I I I I 
g04 1 LYSPDGGGHTGNGTQRESADKLACVLLFLKEFLGX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 159>: 

g041-l .seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAAACAA CGACAAGGCG CGCGCACTTT CAGACGGCAT TTTGAATCAA 

151 ATGCAGGACA CGCGGCAGAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CATCAGAATG CGGAATATCC GAAGGGCGTG TACCGCATGT 

251 GTACGGCGGC GACCTACCGT TCCGGCTATC CCGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGATGA GTTGCTCGGC GACGATGTGT ATTTGGGCGG 

351 CGTGTCGCAC TTGGTGGAGC AGCCCAACCG CGCGCTGCTG ACTTTGAACA 

401 AATCGGGCGG CGATACGGCG TATACGCTGG AAGTGGATTT GGAAGCAGGG 

451 GAATTGGTAG AGGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG GACGAACGCC 

551 AGTTGACCGA ATCGGGCTAT CCGCGCGAAG TGTGGCTGGT GGAACGCGGC 



wo 99/57280 




PCTAJS99/09346 



232 



601 AAGAGTTTCG AGGAAAGCCT GCCGGCGTAC CAAATCGATA AAGGCGCGAT 

651 GATGGTAAAC GCGTGGCGTT ACCTCGATCC GCAGGGTTCG CCGATTGATT 

701 TGATTGAAGC GTCGGACGGT TTTTACACCA AGACGTATTT GCAGGTGTCG 

751 TCCGAAGGCG GGGCGAAACC GTTGAACCTG CCTAATGATT GCGATGTGGT 

801 CGGCTATCTG GCGGGACATC TTTTGCTGAC GCTGCGCAAG GACTGGCACC 

851 GCGCGAACCA AAGCTATCCG AGTGGCGCGT TGGTGGCGGT GAAACTGAAT 

901 CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCAAGCCTGC 

1001 TGGAGAATGT ACAAGGCCGT CTGAAAGCGT GGCGGTTTGC CGACAGCAAA 

1051 TGGCAGGAAG CCGAGTTGCC GCACCTGCCC TCGGGCGCGT TGGAAATGAC 

1101 CGACCAAGCG TGGGGCGGCG ACGTGGTTTA TCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC TCCAGCCGCA GCAGTTTGTT TCAGACGGCA TCGAAGTGCG 

1251 GCAGTTTTGG GCGGTGTCGT CCGACGGCGA ACGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACACGCCGA CCTTAGTCTA TGCTTACGGA 

1351 GGTTTCGGCA TTCCTGAATT GCCGCATTAT CTGGGCAGCG TCGGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCAAACATC CGCGGCGGCG 

1451 GAGAATTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAC 

1501 AAAAGCGTTG ATGATTTGTT GGCAGTCGTG CGTGATTTGT CCGAACGCGG 

1551 CATGAGTTCG CCCAAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGCC 

1601 TGATTACCGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT CGGTGCGCTG 

1651 GTGTGCGAAG TACCGCTGAC CGATATGATC CGTTATCCGC TGCTGTCCGC 

1701 CGGTTCAAGT TGGACGGACG AATACGGCAA TCCGCAGAAA TACGAAGCCT 

1751 GCAAACGCCG GCTGGGCGAA TTGTCGCCGT ATCACT^TCT TTCAGACGGC 

1801 ATCGATTATC CGCCCGCACT CATTACCACC AGCCTCAGCG ACGACCGCGT 

1851 CCATCCCGCC CACGCGCTCA AATTCTACGC CAAACTGCGC GAAACCTCGC 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACCCAACGCG AATCCGCCGA CAAACTCGCC TGCGTGTTGC TGTTTTTGAA 

2001 AGAATTTTTG GGATAA 



This corresponds to the amino acid sequence <SEQ E) 160; ORF 041-l.ng>: 

g041-l .pep 



1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILNQ 

51 MQDTRQIPFC QEHRARMYHF HQNAEYPKGV YR^3CTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLNKSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KSFEESLPAY QIDKGAMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLQVS 

251 SEGGAKPLNL PNDCDWGYL AGHLLLTLRK DWHRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFW ASLLENVQGR LKAWRFADSK 

351 WQEAELPHLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRLQPQQFV SDGIEVRQFW AVSSDGERIP YFHVGKNAAP DTPTLVYAYG 

4 51 GFGIPELPHY LGSVGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAW RDLSERGMSS PKHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEACKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQRESADKLA CVLLFLKEFL G* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 161>: 

m041-l.seq 



1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAGAAAA CGACAAGGCG CGCGCGCTTT CAGACGGCAT TTTGGCGCAG 

151 TTGCAGGACA CGCGGCAGAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CATCAGGACG CGGAGTATCC GAAGGGCGTG TACCGCGTGT 

251 GTACCGCGGC GACGTATCGT TCCGGCTATC CCGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGACGA ATTGCTTGGC GACGATGTGT ATTTGGGCGG 

351 CGTGTCGCAC TTGGTGGAAC AGCCCAACCG CGCGTTGTTA ACACTGAGCA 

4 01 TU^TTGGGCAG CGATACGGCG TACACGCTGG AAGTGGATTT GGAAGCAGGG 

451 GAGTTGGTCG AAGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG AACGAACGCC 

551 AGTTGACCCA ATCGGGCTAT CCGCGCGAAG TATGGCTGGT GGAACGCGGC 

601 AAGAGTTTCG AGGAAAGCCT GCCTGTGTAT CAAATCGGCG AAGACGGCAT 

651 GATGGTGAAC GCGTGGCGTT ATCTCGATCC GCAGGGTTCG CCGATTGATT 

701 TGATTGAAGC GTCGGACGGT TTTTACACCA AAACCTATTT GCGGGTCTCA 
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GCCGAAGGCG AGGCGAAACC GTTT^CCTG CCCAACGATT GCGACGTGGT 
CGGCTATCTG GCGGGGCATC TTTTGCTGAC GCTGCGCAAG GACTGGAACC 
GCGCGAACCA AAGCTATCCG AGCGGCGCGC TGGTGGCGGT GAAGCTGAAT 
CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 
GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCGAGCCTGT 
TGGAGAACGT ACAAGGCCGT CTGAAAGCAT GGCGGTTTGC CGACGGCAAA 
TGGCAGGAAG TCGAATTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 
CGACCAACCT TGGGGCGGCG ACGTGGTTTA CCTTGCCGCC AGCGATTTCA 
CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 
GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 
GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 
TCGGCAAAAA CGCCGCGCCC GACATGCCGA CGCTGGTCTA TGCCTACGGC 
GGTTTCGGCA TTCCCGAATT GCCGCATTAT CTGGGCAGCA TTGGCAAATA 
TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 
GCGAGTTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAT 
AAAAGCGTTG ATGATTTATT GGCAGTCGTG CGCGATTTGT CCGAACGCGG 
TATCAGTTCG CCCGAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 
TGATTACTGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT CGGCGCGCTG 
GTGTGCGAAG TGCCGCTGAC CGACATGATC CGTTATCCGC TGCTCTCCGC 
CGGTTCAAGC TGGACAGACG AATACGGCAA TCGGCAAAAA TACGAAGTCT 
GCAAACGCCG GTTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 
ATCGATTATC CGCCCGCGCT CATTACCACC AGCCTGTCCG ACGATCGCGT 
CCATCCCGCC CACGCGCTCA AGTTCTACGC CAAACTGCGC GAAACCTCCG 
CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 
ACCCAACGCG AATCCGCCGA CGAACTCGCC TGCGTCTTGC TGTTTTTGAA 
AGAGTTTTTG GGCTAA 

This corresponds to the amino acid sequence <SEQ DD 162; ORF 041 -1>: 

m041-l.pep 

1 KKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLENDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

,101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKLGSDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW NERQLTQSGY PREVWLVERG 

201 KSFEESLPVY QIGEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLRVS 

251 AEGEAKPLNL PNDCDWGYL AGHLLLTLRK DWNRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFVV ASLLENVQGR LKAWRFADGK 

351 WQEVELPRLP SGALEMTDQP WGGDWYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

451 GFGIPELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV RDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSAQSWLYS PDGGGHTGNG 

651 TQRESADELA CVLLFLKEFL G* 

m041-l/g041-l 94.6% identity in 671 aa overlap 

10 20 30 40 50 60 

m041-l.pep MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFC 

MIMMIMIIIIIIIIIIIIIIIIIIIIIIilhllllllllMll 
g041-l MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLNNDKARALSDGILNQMQDTRQIPFC 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 1-1 . pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 
I I t M I I I I I I I : I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I i I 
g041-l QEHRARMYHFHQNAEYPKGVYRMCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 1-1 . pep LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 

M I t I I I I I I I I : I I : I I I I I I I I I I I I I I I I I I I I I I I I 

g041-l LVEQPNRALLTLNKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 

130 140 150 160 170 180 

190 200 210 220 230 240 
mO 4 1-1. pep NERQLTQSGYPREVWLVERGKSFEESLPVYQIGEDGMMVNAWRYLDPQGSPIDLIEASDG 
= 11111: : : I I I I 1 I I I I I M I I I I i 



751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
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g041-l DERQLTESGYPREVWLVERGKSFEESLPAYQIDKGAMMVNAWRYLDPQGSPIDLIEASDG 
190 200 210 220 230 240 

250 260 270 280 290 300 

m041-l.pep FYTKTYLRVSAEGEAKPLNLPNDCDWGYLAGHLLLTLRKDWNRANQSVPSGALVAVKLN 
1111111:11:11 I I I I I I I I I I I I I I I I I I I I I I I I I I i I : I I I I I I 1 I I I I I I I I I I 
g041-l FYTKTYLQVSSEGG7UCPLNLPNDCDVVGYLAGHLLLTLRKDWHRANQSYPSGALVAVKLN 

250 260 270 280 290 300 



310 320 330 340 350 360 

m04 1-1 . pep RGELGAAQLLFAPDETQALESVETTKREWASLLENVQGRLPCAWRFADGKWQEVELPRLP 
I I I I I I I I I I I I I I I I I I I i I I I I I I I I [ I I I I I I I I I I I I I I I t I I I : I I I I : I I I : I I 
g041-l RGELGAAQLLFAPDETQALESVETTKRFWASLLENVQGRLKAWRFADSKWQEAELPHLP 

310 320 ,330 340 350 360 

370 380 390 400 410 420 

m041-l.pep SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I i I I I I I I I t I I I I I I I I I I : I : I I I 
g041-l SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRLQPQQFVSDGIEVRQFW 

370 380 390 400 410 420 

430 440 450 460 470 480 

mO 4 1 - 1 . pep TTSADGERI PYFHVGKNAAPDMPTLVYAYGGFG I PELPHYLGS IGKYWLEEGNAFVLANI 
:: I : I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I i I i I I : I I I I I I I I | | | | | | | ) 
■ g041-l AVSSDGERIPYFHVGKNAAPDTPTLVYAYGGFGIPELPHYLGSVGKYWLEEGNAFVLANI 
430 440 450 460 470 480 

490 500 510 520 530 540 

m041-l .pep RGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHIGLQGGSNGGLITAAAF 
I I I I I I I I 11 I I I I M I I I I I I I I I I I I I I I I I I I I I : I I I : I I I I I I I j M M 1! I I I I 
g041-l RGGGEFGPRWHQAAQGISKHKSVDDLLAWRDLSERGMSSPKHIGLQGGSNGGLITAAAF 

490 500 510 520 530 540 



550 560 570 580 590 600 

in041-l . pep VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
I I I I I I I I I I I I I I I I i I I I I I I M I I I I I I I I I I I I I I I I I : I I I I I I I I I M I I I I I I 
g041-l VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEACKRRLGELSPYHNLSDG 

550 560 570 580 590 600 

610 620 630 640 650 660 

in04 1-1 . pep IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGHTGNGTQRESADELA 
I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I t I I H I I : I t 
g041-l IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPDGGGHTGNGTQRESADKLA 
610 620 630 640 650 660 



670 

m041-l .pep CVLLFLKEFLGX 
I I I I I I I I I 1 I I 
g 0 4 1 - 1 C VLLFLKE FLGX 

670 



m041-l/P55577 

sp|P555771 Y4NA RHISN PROBABLE PEPTIDASE Y4NA >gi 12182536 (AE000O86) y4nA [Rhizobium sp. 
NGR2341 Length"- 726 
Score = 370 bits (940), Expect = e-101 

Identities = 217/682 (31%), Positives » 331/682 (47%), Gaps = 22/682 (3%) 

Query: 2 KSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFCQ 61 

K DP + +D + + N T + ++ + L LQ T +1 

Sbjct: 42 KDASDPRAYLNEIDGDKAMTWVEAHNLSTVDKLSKDPRYSEYQADALTILQATDRIASPS 101 

Query: 62 EHRARMY-HFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 120 

R M +F QD + +G++R T +YRSG P+W+ + V + G G 
Sbjct: 102 FARDGMIDNFWQDGTHVQGLWRRTTWESYRSGNPQWRTILDVDALSKAEGKTWVFEGGDC 161 

Query: 121 lveqpnralltlsklgsdtaytlevdleagelveggfhfpagknhvswrdensvwvcpaw 180 

L N L+ LS G D E D+ GE V+ GF P GK V+W DEN+++V W 
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SbjCt: 162 LPPTSNLCLIRLSDGGKDADVVREFDIAKGEFVKEGFVLPEGKQSVTWVDENTIYVTREW 221 



Sbjct: 


162 


Query: 


181 


Sbjct: 


222 


Query: 


233 


Sbjct: 


282 


Query: 


292 


Sbjct: 


338 


Query: 


348 


Sbjct: 


398 


Query: 


408 


Sbjct: 


458 


Query: 


465 


Sbjct: 


518 


Query: 


525 


Sbjct: 


578 


Query; 


585 


Sbjct: 


637 


Query: 


645 


Sbjct: 


697 



++T SGY 



FY 



+V+RG+S ++++ +++ 



L LP 



-IGEDGMM- 
E G++ 



-VNAWRYLDPQGSPI 232 
++ +D + 



GY G 



L+ DW A + + + 



GA++A L 



G W 



A++ 



—LFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFA 347 
LF P+E Q++ TK +V S+L NV ++++ F 



L +T 



D +++ + F P TLF 



+FD+ G+ QQFW TS DG ++PYF V + 



GK WLE+G A+ LANIRGGGEFGP+WH A 



PT++YAYGGF IP P Y 



517 



++ . + DD AV +DL + ++S H+ 



G+ GGSNGGL+ 



L +SPYHN+ G+ YP 



A+V +VPL DM+ + +SAG+SW EYG+P 



TS DDRV P HA K A 



+ Y 



GH 



+E A 



identified in A^. meningitidis <SEQ ID 163>: 



The following partial DNA sequence was 

a041-l . seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAAACAA CGACAAGGCA CGCGCATTGT CTGACGGCAT TTTGGCGCAG 

151 TTGCAGGACA CGCGGCAAAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CATCAAGATG CGGAATATCC GAAAGGCGTG TACCGCGTGT 

251 GTACCGCGGC GACTTACCGT TCGGGCTATC CTGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGACGA ATTGCTCGGT GACGATGTAT ATCTAGGCGG 

351 CGTGTCGCAC CTGGTGGAAC AGCCCAACCG CGCGTTGTTA ACACTGAGCA 

401 AATCGGGCGG CGATACCGCG TACACGCTGG AAGTGGATTT GGAAGCAGGG 

451 GAGTTGGTAG AAGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG GACGAACGCC 

551 AGTTGACCGA ATCGGGCTAT CCGCGCGAGG TGTGGCTGGT GGAACGCGGC 

601 AAGAGTTTCG AGGAAAGCCT GCCGGTGTAC CAAATTGCTG AAGACGGCAT 

651 GATGGTGAAC GCGTGGCGTT ACCTCGATCC GCAGGGTTCG CCGATTGATT 

701 TGATTGAAGC GTCTGACGGT TTTTACACCA AAACCTATTT GCAGGTCTCA 

751 GCCGAAGGCG AAGCGA7UVCC GTTAAACCTG CCCAACGATT GCGACGTAGT 

801 CGGCTATCTG GCCGGACATC TTTTGCTGAC CTTGCGTAAA GACTGGCACC 

851 GCGCGAACCA AAGCTATCCG AGTGGCGCAT TGGTAGCAGT AAAATTAAAC 

901 CGCGGCGAAT TGGGCGCGGC GCAGCTTTTG TTTGCGCCCA ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTCGTG GCGAGCCTGC 

1001 TGGAAAACGT ACAGGGTCGT CTGAAAGCGT GGCGTTTTAC TGATGGCAAA 

1051 TGGCAGGAAA CCGAGTTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 

1101 CGACCAAGCG TGGGGGGGCG ACGTAGTTTA CCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCATTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 

1251 GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA CGCTGGTCTA TGCCTACGGC 

1351 GGTTTCGGCA TTCCCGAATT GCCGCATTAT CTGGGCAGCA TTGGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 
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1451 GCGAGTTCGG CCCGCGCTGG 

1501 AAAAGCGTTG ATGATTTATT 

1551 TATCAGTTCG CCCGAACACA 

1601 TGATTACTGC CGCCGCCTTC 

1651 GTGTGCGAAG TGCCGCTGAC 

1*701 CGGTTCAAGC TGGACAGACG 

1751 GCAMCGCCG GTTGGGCGAA 

1801 ATCGATTATC CGCCCGCGCT 

1851 CCATCCCGCC CACGCGCTCA 

1901 CGCAATCTTG GCTCTACTCG 

1951 ACGCAGCGCG AAGCCGCCGA 

2001 AGAGTTTTTG GGCTAA 



CATCAGGCGG CGCAGGGAAT CAGCAAACAT 
GGCAGTCGTG AGCGATTTGT CCGAACGCGG 
TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 
GTGCGCGAAC CGCAAAGCAT AGGCGCGCTG 
CGACATGATC CGTTATCCGC TGCTCTCCGC 
AATACGGCAA TCCGCAAAAA TACGAAGTCT 
TTGTCGCCGT ATCACAATCT TTCAGACGGC 
CATTACCACC AGCCTGTCCG ACGATCGCGT 
AGTTCTACGC CAAACTGCGC GAAACCTCGC 
CCTGACGGCG GCGGCCATAC CGGCAACGGC 
CGAACTCGCC TGCGTGTTGC TGTTTTTGAA 



This corresponds to the amino acid sequence <SEQ ID 164; ORF 041-l.a>: 

aO 41-1. pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KSFEESLPVY QIAEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYIiQVS 

251 AEGEAKPLNL PNDCDWGYL AGHLLLTLRK DWHRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPNETQALE SVETTKRFW ASLLENVQGR LKAWRFTDGK 

351 WQETELPRLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

451 GFGIPELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV SDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQREAADELA CVLLFLKEFL G* 



a041-l/m041-l 97.9% identity in 671 aa overlap 

10 20 30 40 50 60 

a041-l.pep MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLNNDKARALSDGILAQLQDTRQIPFC 
I I I I I I I I 1 I It I I I t I I I I I I I I I I I I j I I I I I I : I I I I I I I M i I I I I I I I I I I I I I I 
in041-l MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFC 

10 20 30 40 50 60 



70 80 90 100 110 120 

a041-l.pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 
I I I I I I I I I I I I M i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I [ I I I I I I I I I ! I I 
m041-l QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 80 90 100 110 120 

130 140 150 160 170 180 

a041-l . pep LVEQPNRALLTLSKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
I M M I I N I I I I I I : I I I I I I I M I I I I I 1 I I I I I I I I I I I I I I I I t I i I I I i I I I I I 
m041-l LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 

130 140 150 160 170 180 

190 200 210 220 230 240 

a041-l . pep DERQLTESGYPREVWLVERGKSFEESLPVYQIAEDGMMVNAWRYLDPQGSPIDLIEASDG 

H I I I I • I M I I I I I I I I I I I I I I I M I I I I I : I I I I I I I I I I I I f I I t I I I I 

m041-l NERQLTQSGYPREVWLVERGKSFEESLPVYQIGEDGMMVNAWRYLDPQGSPIDLIEASDG 

190 200 210 220 230 240 

250 260 270 280 290 300 

a04 1-1 . pep FYTKTYLQVSAEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWHRANQSYPSGALVAVKLN 
IMMII:lllllllllllll||||tlllllllltlllllll:||]|||||||||MII| 
m041-l FYTKTYLRVSAEGEAKPLNLPNDCDWGYLAGHLLLTLRKDWNRANQSYPSGALVAVKLN 

250 260 270 280 290 300 

310 320 330 340 350 360 

a04 1-1 . pep RGELGAAQLLFAPNETQALESVETTKRFVVASLLENVQGRLKAWRFTDGKWQETELPRLP 

I flltM:llllllll||||||||MIIIII|||||||||:tlllll: 

in041-l RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADGKWQEVELPRLP 
310 320 330 340 350 360 
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370 380 390 400 410 420 

a041-l.pep SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
I I I I M I I I I I I I I f I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I t M I I M I I I 
m041-l SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 

370 380 390 400 410 420 

430 440 450 460 470 480 

a041-l.pep TT S ADGERI PY FHVGKNAAPDMPTLVYAYGGFG I PELPHYLGS I GK YWLEEGNAFVLANI 
I I I I I i I [ I I I I It I I I I I I I I I I I I M I I I I I I I I I I t I I M I I I I I I I I I I I I I I I I I 
in041-l TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 
430 440 450 460 470 480 



490 500 510 520 530 540 

a041-l.pep RGGGEFGPRWHQAAQGISKHKSVDDLLAWSDLSERGISSPEHIGLQGGSNGGLITAAAF 
I I I I I I I I I I t I I I I I I i ) I I M I I I I I I I ) I t'l I I I I I I I I I I I I I I I I I I I I I I I I t 
m041-l RGGGEFGPRWHQAAQGISKHKSVDDLLAWRDLSERGISSPEHIGLQGGSNGGLITAAAF 

490 500 510 520 530 540 

550 560 570 580 590 600 

a041-l.pep VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I 
m041-l VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 

550 560 570 580 590 600 



610 620 630 640 650 660 

a041-l.pep IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPDGGGHTGNGTQREAADELA 
I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I: I I I I i 
ni041-l IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGHTGNGTQRESADELA 

610 620 630 640 650 660 

670 

a041-l .pep CVLLFLKEFLGX 
I I I I I I I It I I I 
mO 4 1 - 1 C VLLFLKE FLGX 

670 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 165>: 

9042.seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAagcG TTCGTGCCGC ATACCAGCGC 

51 GTTATCCAAC ACTTCCACGG CAGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TGCGGTCGAT GATGAAAATC CAGCCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACGG GCTGCCCGTG CCCTTCGTTG CGTAAAGATT CGTCCACGGG 

201 CGGCAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GATTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCTGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG cggCTTCGCG CTTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 ATTCAATTCC GCCACGCGCG CTTCCTTACC GAAAATCCGC GACAGGGTCT 

401 CCATCTGCTT CTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAAA 

451 TCTATGgtgG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCACCCGG 

501 CCCGCCGGTA ATGACAAACT GCGGATTGTG GCGGTGCAGG GATTCGCAAT 

551 CGGGCTCAAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 

601 AAATAG 

This corresponds to the amino acid sequence <SEQ ID 166; ORF 042.ng>: 

g042 .pep 

1 MTMICLRFQA FVPHTSALSN TSTAAGPSCP MAAVRSMMKI QPGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRFWA NSASICAFNS ATRASLPKIR DR VSICFSPL VRILPLSTV K 

151 SMWAFFANC SYASAPGPPV MTNCGLWRCR DSQSGSNSVP TVAALSNAGC 

201 K* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 167>: 
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ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 
GTTATCCAmT ACTTCGACAG CCGcCGGCCy TTCyTGCCCG ATGGCGGCGG 
TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 
AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 
CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 
CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 
TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 
CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 
CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 
TCTATGGTGG TCGCGTTTTT CGCTAACTGT TCATACGCTT CCGCGCCCGG 
CCCGCCGGTA ATGACAAGCT GAGGATTGTA GCGGTGCAGG GCTTCGTAAT 
CGGGCTCGAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 
AAATAA 

This corresponds to the amino acid sequence <SEQ ID 168; ORF 042>: 

m042 .pep 

1 MTMICLRFQA FVPRTSALSX TSTAAGXSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTV R 

151 SMWAFFANC SYASAPGPPV MTSXGLXRCR ASXSGSNSVP TVAALSNAGC 

201 K* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 169>: 

a042.seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

4 51 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA ATGACAAGCT GAGGATTGTA GCGGTGCAGG GCTTCGTAAT 

551 CGGGCTCGAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 

601 AAATAA 

This corresponds to the amino acid sequence <SEQ ID 170; ORF 042.a>: 

a042 .pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTV R 

151 SMWAFFANC SYASAPGPPV MTS*GL*RCR AS*SGSNSVP TVAALSNAGC 

201 K* 

m042/a042 99.0% identity over a 201 aa overlap 

10 20 30 40 50 60 

MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I i 
MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
10 . 20 30 40 50 60 

70 80 90 100 110 120 

RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWMSASICAFNS 

M I I t I I I I I I I I M I t I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I M I I I 

RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
70 80 90 100 110 120 

130 140 150 160 170 180 

AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSXGLXRCR 



m042 .seq 

1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 



m04 2 .pep 
a042 

m042,pep 
a042 

in042 .pep 
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I I I I I I I I I ) I I I I I I I I I I I I 1 M I I I I I I I I I I I I I I I t I 1 I I I I I I I I I I I I I I I I I 
a042 AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSXGLXRCR 
130 140 150 160 170 180 

190 200 
m04 2 . pep ASXSGSNSVPTVAALSNAGCKX 
I I I I I I I I t I t I I I I I 1 I I I I I 
a04 2 ASXSGSNSVPTVAALSNAGCKX 

190 200 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 042 shows 93.0% identity over a 201 aa overlap with a predicted ORF (ORF 042.ng) 
from N, gonorrhoeae: 

m042/g042 

10 20 30 40 50 60 

ra042.pep MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I I I I I I I I I I I i : I I I I I I I I I I I I I I I I 1 i i I M I I I I I I II I i I I I I I I I I I I I 
g042 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 2.pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I II I II II I I I II I I II M II I I I II I I I I I 11 I I I I I I II I I I I 1 I I I I I I I I I I I I I 
g04 2 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRFWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 180 

in042.pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSXGLXRCR 
hlMlllll :IIMItllllllllll|:||]lllllllllllllllllll: M III 
g042 ATRASLPKIRDRVSICFSPLVRILPLSTVKSMWAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

190 200 
m04 2 . pep ASXSGSNSVPTVAALSNAGCKX 
I I I I I I I I I I I I I M I M I I 
g042 DSQSGSNSVPTVAALSNAGCKX 

190 200 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 171>: 

m042-l . seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGA7VAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GA7VAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

451 TCTATGGTGG TCGCGTTTTT CGCTAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ID 172; ORF 042-1 >: 

in042-l .pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCXQLANR DCVPKADTLL PVTDSTSPRP 
101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTV R 
151 SMWAFFANC SYASAPGPPV MTS* 



m042-l/g042 95.4% identity in 173 aa overlap 
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10 20 30 40 50 60 

in04 2-l.pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I I I t I I I I I I I I : t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g04 2 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m042-l.pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I I I I I I I I I i I I I I I I I I I I I I i I I I I I I I I M I I I I I I I I I I I I I M I I I I M I I i I 
g042 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRFWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 

m04 2-l.pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSX 

I : I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I i : 
g042 ATRASLPKIRDRVSICFSPLVRILPLSTVKSMWAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

g042 DSQSGSNSVPTVAALSNAGCKX 
190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 173>: 

a042-l.seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

401 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

451 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ED 174; ORF 042-l.a>: 

a042-l.pep 

1 MTMICLRFQA FVPRTSTVLSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 
101 LPLAASRVWA NSASICAFNS AARASLPKIR AKVSICFSPL VRILPLSTVR 
151 SMVVAFFANC SYASAPGPPV MTS* 



m042-l/a042-l 100.0% identity in 173 aa overlap 

10 20 30 40 50 60 

m042-l.pep MTM I CLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYS KETGCPCPSL 
1 I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I M I I I 1 I I I I I I I 
a042-l MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m042-l.pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I i I I I I I I I I I I I I I I I I i 1 I I I I I 
a042-l RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 

m042-l.pep ATVRASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSX 

IMIIIIIIIIIIMIIIIIlilMlllltlllllltllllillllllllllll 

a042-l AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSX 
130 140 150 160 170 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 175>: 
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g043 . seq 

1 ATGGTTGTTT CAAATCAAAA 

51 TCACATCCGA AGGCAAAAAT 

101 CGTCCCGCGT ggcggtagcc 

151 GCCCGATTCG ATGAGGGcga 

201 GTCCGGCGAC GGTTTCGCCG 

251 CATTCGTTCA GCCCGACGCG 

301 GGCGAATTCG CTGTTCAAAA 

351 GGTAGCCGTT GTCGTGGCGG 



TATCTATGCC GTCGGCCCAT CAGCACTTTT 
CCGTAATGCC GCCTGAACGC TTCgttgaAC 
gcAAAAGTGC ATcGCGGCTT GGATGGTGCT 
gcGCGTGTTC CAGCCGCAGG CGGCGCAGGC 
GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 
GCGGGCGATT TCGGCGATGG TCAGCGGGCG 
TATCGGCGGC TTCGTCTATG CGCCGGCGGC 
CGGAAGGTGA AGCGTAA 



This corresponds to the amino acid sequence <SEQ ID 176; ORF 043.ng>: 

g043.pep 

1 MWSNQNIYA VGPSALFHIR RQKSVMPPER FVEPSRVAVA AKVHRGLDGA 

51 ARFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQPDA AGDFGDGQRA 

101 GEFAVQN IGG FVYAPAAVAV WAA EGEA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 177>: 

in043 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 

151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAgGC 

201 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

251 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 

This corresponds to the amino acid sequence <SEQ ID 178; ORF 043>; 

ni04 3.pep 

1 MWSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGQRT 

101 GEFVLQDVGG FVYAPTAVTV WAAEGEAQ* 

Computer analysis of the amino acid sequences gave the following results: 

Homology with a predicted ORF from A^. meningitidis menA with menB 

ORF 043 shows 89.8% identity over a 128 aa overlap with a predicted ORF (ORF043.a) from 

K gonorrhoeae: 

m043/g043 



10 20 30 40 50 60 

in043.pep MWSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
lllllllilhllMhIIIMIIMt I I I I I i I M I I I I I M IMIII llllllil 
g043 MVVSNQNIYAVGPSALFHIRRQKSVMPPERFVEPSRVAVAAKVHRGLDGAARFDEGERVF 

10 20 30 40 50 60 



"70 80 90 100 110 120 

mO 4 3 . pep QPQAAQASGDGFAGLRFE I AFQVAFVQSDAAGDFGDGQRTGEFVLQDVGG FVYAPTAVTV 

I I I I i I I I I I I I I I I I I I I I I I I I I I I 
g043 QPQAAQASGDGFAGLRFEIAFQVAFVQPDAAGDFGDGQRAGEFAVQNIGGFVYAPAAVAV 

70 80 90 100 110 120 



130 

m043.pep WAAEGEAQX 

llllllil 
g043 WAAEGEAXX 
130 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 1 79>: 

a043. seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 



wo 99/57280 



PCT/US99/09346 



242 



151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAGGC 

201 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

251 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 



This corresponds to the amino acid sequence <SEQ ID 180; ORF 043.a>: 

a04 3 .pep 

1 MWSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGQRT 

101 6EFVLQD VGG FVYAPTAVTV WAA EGEAQ* 

m043/a043 100.0% identity in 129 aa overlap 



10 20 30 40 50 60 

m043 . pep MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVA/UCVHGGLDGAAGFDEGERVF 
I I I i I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 t I I I I I I I I I I I I I I I I I I I I I I I I 
a043 MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 3 . pep Q PQ AAQ AS G DG FAG LR FE I AFQVAFVQSDAAGDFGDGQRTGEFVLQDVGG FVYAPTAVTV 

I I i I I I I I I I i I I I i I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I M I t I 1 I I I I M 
a04 3 QPQAAQASGDGFAGLRFEIAFQVAFVQSDAAGDFGDGQRTGEFVLQDVGGFVYAPTAVTV 

70 80 90 100 110 120 



130 

m043.pep VVAAEGEAQX 
I I I M I I I I 1 
a04 3 VVAAEGEAQX 
130 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 181>: 

g044 .seq 

1 ATGCTGCCCG ACCAGAGCGT CGAGTTCTTG CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGCCC TTCGATAACG GCGGTCAGCT 

201 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 CGGCTGCCGT AGCGCATTAA 



This corresponds to the amino acid sequence <SEQ ID 182; ORF 044.ng>: 

g044,pep 

1 MLPDQSVEFL PQ VWFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQP FDNGGQLHAV VGGLRFAAEK FFFAAAVAH* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 183>: 

in044. seq 

1 ATGCCGTCCG ACTAGAGCGT CGAGTTCTTT CCAGAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGAGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCATTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCAGTCAGTT 

201 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTACCGT AGCGCAyTAa 



This corresponds to the amino acid sequence <SEQ ID 184; ORF 044>: 

m044.pep 

1 MPSDXSVEFF PE VWFDGLF GGGFPAVAL P TVYPVFHAIF DVLRVGADDD 
51 GAAAFERFQS FDDGSQFHAV VGGLRFAAEK FFFVATVAH* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 185>: 

a044 . seq 

1 GTGCCGTCCG ACCAGCGCGT CGAGTTCTTT CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCGGTCAGTT 

201 CCATACGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTGCCGT AGCGCATTAA 

This corresponds to the amino acid sequence <SEQ ID 186; ORF 044.a>: 

a044 .pep 

1 VPSDQRVEFF PQ VWFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQS FDDGGQFHTV VGGLRFAAEK FFFVAAVAH* 

m044/a044 91 .0% identity over a 89 aa overlap 

10 20 30 40 50 60 

m04 4 . pep MPSDXSVEFFPEVWFDGLFGGGFPAVALPTVYPVFHAIFDVLRVGADDDGAAAFERFQS 
: I I I I I I i I : I I I I I i I I I I I I I I I I I I I I I I I M I : I i I I I I I I I I f I I I I I I I I I I 
a04 4 VPSDQRVEFFPQVWFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQS 
10 20 30 40 50 60 

70 80 90 

FDDGSQFHAWGGLRFAAEKFFFVATVAHX 
I I I I : I M : I I I I I I I I I I I I I I i I : I I I i 
FDDGGQFHTWGGLRFAAEKFFFVAAVAHX 
70 80 90 

Computer analysis of this amino acid sequence gave the following results; 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 044 shows 86.5% identity over a 89 aa overlap with a predicted ORF (ORF 044.ng) 
from N. gonorrhoeae: 

10 20 30 40 50 60 

MPSDXSVEFFPEVWFDGLFGGGFPAVALPTVYPVFHAIFDVLRVGADDDGAAAFERFQS 

I I lllhlHIMIIIIIIIIMIIIIIIIIIMhlllllllllllllMIIIII 

MLPDQSVEFLPQVWFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQP 
10 20 30 40 50 60 

70 80 90 

FDDGSQFHAWGGLRFAAEKFFFVATVAHX 

lh|:hlllllllllllllli|:|:|lll 
FDNGGQLHAWGGLRFAAEKFFFAAAVAHX 
70 80 90 



m04 4 .pep 
a044 



m044/g044 

m044 .pep 
g044 

m044 .pep 
g044 



The following partial DNA sequence 



g046 . seq 






1 


ATGTCGGCAA 


TGCTGCGTCC 


51 


GATGACCATC 


CGCACGCGGT 


101 


CCGGGCAGTC 


TATCAGGCCG 


151 


CTGATGGTTT 


CGGTTATGCC 


201 


TTCGAGCTTG 


GGGCTGAGGT 


251 


CGATGCGGCC 


GGGGATGTTG 


301 


ATGTTGGTTT 


CGTCGCTGCG 


351 


CgcgccggcG 


Cgttataatg 


401 


TTTGGacatc 


gaggcggctg 


451 


TCGATAAcgg 


TTACGTCGTT 



identified in N. gonorrhoeae <SEQ ID 1 87>: 

GACAAGCAGC CCGCCGCgcc gCGCCTGTAT 
CGTCTGCAAA ACGTAAAACC TGCAATGCGC 
GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 
gaATATGGAA AGGCTGCCGt TTTcGTTGTT 
ATTCGAGGTA TtcgctGGAA CGGACGCGCG 
AACAGGTCGG CGGCAACTTT GCAGGCGACG 
GGagaGCGCG AGcagcaagt cggcatcttC 
tgAAGGGGGA TGCGccgttg ccgaAAACGG 
CCTGTTTCCT GCAATGCTTT TTCGTCGATG 
GTTGGTGATG GCGGCAAGGT TTTGCGCGAC 
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501 GGTAGAACCT ACCTGCCCGT TGCCTAAAAT GAGGATTTTC ACGGTATGGG 
551 TCGCCGGGTG A 



This corresponds to the amino acid sequence <SEQ ID 188; ORF 046.ng>: 

g046.pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RYNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SITVTSLLVM AARFCATVEP TCPLPKMRIF TVWVAG* 



The following partial DNA sequence 



m046 . seq 






1 


ATGTCGGCAA 


TGCTGCGTCC 


51 


GATGACCATC 


CGCACGCGGT 


101 


CCGGGCAGTC 


TATCAGGCCG 


151 


CTGATGGTTT 


CGGTTATGCC 


201 


TTCGAGCTTG 


GGGCTGAGGT 


251 


CGATGCGGCC 


GGGGATGTTG 


301 


ATGTTGGTTT 


CGTCGCTGCG 


351 


CGCGCCGGCG 


CGTTCTAATG 


401 


TTTGGACATC 


GAGGCGGCTG 


451 


TCGATAACGG 


TTACGTCGTT 


501 


GGTAGAACCT 


ACCTGTCCGT 


551 


TCGCCGAGTG 


A 



identified in N. meningitidis <SEQ ID 189>; 

GACAAGCAsT CCGCr.sGC gCGcCTGTAT 
CGTCTGCAAA ACGTAAAACC TGCAATGCGC 
GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 
GAATATGGAA AGGCTGCCGT TTTCGTTGTT 
ATTCGAGGTA TTCGCTGGAA CX3GACGCGCG 
AACAGGTCGG CGGCAACTTT GCAGGCGACG 
GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 
TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 
CCTGTTTCCT GCAATGCTTT TTCGTCGATG 
GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 
TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 



This corresponds to the amino acid sequence <SEQ ID 190; ORF 046>: 

Tn046 .pep 

1 MSAMLRPTSX PXXRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SITVT3LLGM AARFCATVEP TCPLPKMRIF TVWVAE* 



The following partial DNA sequence was identified in A'^. meningitidis <SEQ ID 191>: 

a04 6. seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGT CCGCCGCGCC GCGCCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCGCTGGAA CGGACGCGCG 

251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

301 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 

351 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

401 TTTGGACATC GAGGCGGCTG CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

461 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 

501 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 

551 TCGCCGAGTG A 

This corresponds to the amino acid sequence <SEQ ID 192; ORF 046.a>: 

a046.pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SITVTSLLGM 7UVRFCATVEP TCPLPKMRIF TVWVAE* 



m046/a046 98.4% identity over a 186 aa overlap 

10 20 30 40 50 60 

in04 6 .pep MSAMLRPTSXPXXRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
I I I I I f M I I I I I i I I I I I M I I I I I I I t I I I I I I I I t I I I I I I I I I I I I I I I I M I 
a04 6 MSAMLRPTSSPPRRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m04 6 . pep RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 
I I I I I I I I I I f I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I t i i I I I I I t I I I 
a04 6 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

70 80 90 100 110 120 

130 140 150 160 170 180' 

m04 6.pep RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 
I I I I I I I I I I I I I I M I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I t i I I I 
a04 6 RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 

130 140 150 160 170 180 



m04 6.pep TVWVAEX 
I I I I I I I 

a04 6 TVWVAEX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 046 shows 97.3% identity over a 1 85 aa overlap with a predicted ORF (ORF 046.ng) 
from K gonorrhoeae: 

m046/9046 

10 20 30 40 50 60 

m04 6 . pep MSAMLRPTSXPXXRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 

IIIIMIII I IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

g046 MSAMLRPTSSPPRRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
10 20 30 40 50 60 

70 80 90 100 110 120 

m04 6 . pep RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

IIIIIIIIMIIIIMMIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIII 

g046 RLPFSLFSSLGLRYSRYSLERTRAMRPGMIiNRSAATLQATMLVSSLRESASSKSASSAPA 
70 80 90 100 110 120 

130 140 150 160 170 ISO 

m04 6 . pep RSOTKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 

I IIIIIIMIIIMIMIIIIIIIIMIIIIIIIIII lllllllllllllllllllll 

g046 RYNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLVMAARFCATVEPTCPLPKMRIF 
130 140 150 160 170 180 

m046.pep TVWVAEX 

mil 

g046 TVWVAGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ E> 193>: 

g047 . seq 

1 ATGGTCATCA TACAGGCGcg gcGCGGCGGG CTGCTTGTCG GACGCAGCAT 

51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT GTTTGCCGCC GCCGCCGAAA ACATCGGGGC 

201 GGTCATACCC gaATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 

251 TTGCCGGCGG CGGCAACATc tgctACCGCC TCGCCAAGCA GCTCGAACAC 
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301 GCATAcaacG TCAAAATCAT CGAATGCCGG CCGCGCcgtg ccgaATGGAT 

351 AGCCGAAAAC ctcgAcaaCA CCCTCGTCCT GCAAGGTTCG Gcaaccgacg 

401 aAaccctgct cgAcaacgaa tacatcgacg aaatcgaCGT ATTCTGCGCC 

451 CTGACCAACG ACGACGAAAG CAACATTAtg tCCGCCCTTT TGGCGAAAAA 

501 CCTcggcgCG AAGCgcgtca tcggCATCGT CAACCGCTCA AGCTACGTCG 

551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 

601 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 

651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCGCACG 

701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 

751 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 

801 AACCATTATG GGACACCATA CCGAAACCGT CATCCAAGAC GGTGACCACA 

851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAGAAACTC 

901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 194; ORF 047.ng>: 

g047 .pep 

1 MVIIQARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 lEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI CYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA . 

151 LTNDDESNIM SALIiAKNLGA KRVIGIVNRS SYVDLLEGNK ID IWSPHLI 

201 TIGSILAHI R RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGETIM GHHTETVIQD GDHIIFFVSR RRILNELEKL 

301 IQVKMGFFG* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 195>: 

in047 . seq 

1 ATGGTCATCA TACAGgCGcG C . . syGCGGA STGCTTGTCG GACGCAGCAT 

51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 

201 GGTCATACCC GAATTGCGCC CCAAAGAAAC CCAAAGAAAC CAGCcCmgmm 

251 GcATCATGAT TkCCGGCGGC GGCAACATCG GCTACCGTCT CGCCAAGCAG 

301 CTCGAACACG CATACAACGT yAAAATCATC GAATGCCGGC CGCGCCGTGC 

3 51 CGAATGGATA GCCGAAAACC TCGACAACAC CCTCGTCyTG CAAGGTTCGG 

401 CAACCGACGA AACCCTGCTC GACAACGAAT ACATCGACGA AATCGACGTA 

451 TTCTGCGCCC TGACCAACGA CGACGAAAGC AACATTATGT CCGCCCTTTT 

501 GGCGAaAAAC CTCGGCGCGA AGCGCGTCAT CGGCATCGTC AACCGCTCAA 

551 GCTACGTCGA TTTGCTCGAA GGCAACAAAA TCGACATCGT CGTCTCCCCC 

601 CACCTCATCA CCATCGGCTC GATACTCGCC CACATCCGGC GCGGCGACAT 

651 CGTTGCCGTC CACCCCATCC GGCGCGGCAC GGCGGAAGCC ATCGAAGTCG 

701 TCGCACACGG CGACAAAT^ ACTTCCGCCA TCATCGGCAG GCGCATCAGC 

751 GGCATCAAAT GGCCCGAAGG CTGCCACATT GCCGCCGTCG TCCGCGCCGG 

801 AACCGGCGAA ACCATTATGG GACACCATAC CGAAACCGTC ATCCAAGACG 

851 GCGACCACAT CATCTTTTTC GTCTCGCGCC GGCGCATCCT GAACGAACTG 

901 GAAAAACTCA TCCAGGTCAA AATGGGCTTT TTCGGATAA 



This corresponds to the amino acid sequence <SEQ ID 196; ORF 047>: 

m047 .pep 

1 MVIIQARXXG XLVGRSIADI AQDLPDGADC QICAVYRIONR LIVPAPQTVI 

51 lEGDEILFAA AAENIGAVIP ELRPKETQRN QPXXIMIXGG GNIGYRLAKQ 

101 LEHAYNVKII ECRPRRAEWI AENLDNTLVL QGSATDETLL DNEYIDEIDV 

151 FCALTNDDES NIMSALLAKN LGAKRVIGIV NRSSYVDLLE GNKID IWSP 

201 HLITIGSILA HI RRGDIVAV HPIRRGTAEA lEWAHGDKK TSAIIGRRIS 

251 GIKWPEGCHI AAWRAGTGE TIMGHHTETV IQDGDHIIFF VSRRRILNEL 

301 EKLIQVKMGF FG* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 197>: 

a047 . seq 

1 ATGGTCATCA TACAGGCGCG GCGCGGCGGA CTGCTTGTCG GACGCAGCAT 
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51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 

201 GGTCATACCC GAATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 

251 TTGCCGGCGG CGGCAACATC GGCTACCGTC TCGCCAAGCA GCTCGAACAC 

301 GCATACAACG TCAAAATCAT CGAATGCCGG CCGCGCCGTG CCGAATGGAT 

351 AGCCGAAAAC CTCGACAACA CCCTCGTCCT GCAAGGTTCG GCAACCGACG 

401 AAACCCTGCT CGACAACGAA TACATCGACG AAATCGACGT ATTCTGCGCC 

4 51 CTGACCAACG ACGACGAAAG CAACATTATG TCCGCCCTTT TGGCGAAAAA 

501 CCTCGGCGCG AAGCGCGTCA TCGGCATCGT CAACCGCTCA AGCTACGTCG 

551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 

601 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 

€51 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCACACG 

701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 

751 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 

801 AACCATTATG GGACACCATA CCGAAACCGT CATCCAAGAC GGCGACCACA 

851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAAAAACTC 

901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 



This corresponds to the amino acid sequence <SEQ ID 198; ORF 047.a>: 

. a047 .pep 

1 MVIIQARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 lEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI GYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNRS SYVDLLEGNK ID IWSPHLI 

201 TIGSILAHI R RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGETIM GHHTETVIQD GDHIIFFVSR RRILNELEKL 

301 IQVKMGFFG* 



xn047/a047 96.5% identity over a 3 12 aa overlap 



10 20 30 40 50 60 

in047 .pep MVIIQARXXGXLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVIIEGDEILFAA 
I I I I I I I I I I I I I I I I t i I I I I I I I I I I I I I I I I I i I t I M I I I I t I I I I I I I I I I I 
a 0 4 7 MVII QARRGGLLVGRS I ADI AQDLPDGADCQI CAV YRNNRLI VPAPQT VI I EGDE I LFAA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ni047.pep AAENIGAVIPELRPKETQRNQPXXIMIXGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 
t I I I i I I I I I I I I I I i I : : Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 
a04 7 AAENIGAVIPELRPKETSTRR IMIAGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 

70 80 90 100 110 



130 140 150 160 170 180 

m047.pep AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 

I I I I I I I I i I M I I I I I I I I I I I I I I I I I I I I 1 I I I I I ) I i I I I I I I I I I I I I I I I I I I I 
a047 AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 

120 130 140 150 160 170 



190 200 210 220 230 240 

in047 .pep NRSSYVDLLEGNKIDIVVSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 

I I I I I I I I I I I I I I I I M I i I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a047 NRSSYVDLLEGNKIDIWSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 

180 190 200 210 220 230 



250 260 270 280 290 300 

in047 . pep TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 

I I I M M I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I M I I I I I I I f I I I I I I 1 I I I I I I 
a047 TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 

240 250 260 270 280 290 



ra047 . pep 



310 

EKLIQVKMGFFGX 
I I I I I I I I I I I I I 
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a047 EKLIQVKMGFFGX 
300 310 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 047 shows 96.2% identity over a 312 aa overlap with a predicted ORF (ORF 047.ng) 
from N, gonorrhoeae: 

m047/g045 

m047 . pep MVIIQARXXGXLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVIIEGDEILFAA 60 

lllllll I lllllllllllllllllllllllllllllllllllllllllllllllll 
g047 MVIIQARRGGLLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVIIEGDEILFAA 60 

m047 .pep AAENIGAVIPELRPKETQRNQPXXIMIXGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 120 

IIMIIIIIIIIIIIII^ : III lllll llllllllllllllllllllllllll 
g04 7 AAENIGAVI PELRPKETSTRR- - - IMI AGGGNI CYRLAKQLEHAYNVKI I ECRPRRAEWI 117 

m047 . pep AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 180 

II II Mil II III II III Milllllllll I lllll II I II II II I IMI Mill I MM 

g047 AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 177 

m04 7.pep NRSSYVDLLEGNKIDIWSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 240 

IMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMMMIIMIMIIIIIIIII 

g047 NRSSYVDLLEGNKIDIWSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 237 

m047 . pep TSAIIGRRISGIKWPEGCHI/^WRAGTGETIMGHHTETVIQDGDHIIFFVSRRRIIjNEL 300 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM . 

g047 TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 297 

m047.pep EKLIQVKMGFFGX 313 

lllllllllllll 
g047 EKLIQVKMGFFGX 310 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 199>: 



g048 . seq 












1 


ATGCTCGACA 


AAGGCGAGGA 


GTTGCCCGTC 


GATTTCACCA 


ACCGCCTGAT 


51 


TTACTACGTc 


ggcCCcgTCG 


ATCCGGTCGG 


CGATGAAGTC 


GTCGGTCCCG 


101 


CAGGTCCGAC 


CACAGCCACC 


CGCATGGACA 


AATTTACCCG 


CCAAATGCTC 


151 


AAACAAACCG 


GCCTCTTGGG 


CATGATCGGC 


AAATCCGagc 


gcgGcgcggc 


201 


cacctGCGAA 


GCcatCGCCG 


ACAACAAGGC 


CGTGTACCTC 


ATGGCAGTCG 


251 


GCGGCGCGGC 


ATACCTCGTG 


GCAAAAGCCA 


TCAAATCTTC 


CAAAGTCTTG 


301 


GCGTTCCCCG 


AATTGGGTAT 


GGAAGCCGTT 


TACGAATTTG 


AAGTCAAAGA 


351 


TATGCCCGTA 


ACCGTCGCCG 


TGGACAGCAA 


AGGCGAATCC 


ATCCACGCCA 


401 


CCGCCCCGCG 


CAAATGGCAG 


GCGAAAATCG 


GCATCATCCC 


CGTCGAGTCT 


451 


TGA 











This corresponds to the amino acid sequence <SEQ ID 200; ORF 048 .ng>: 
g04 8 .pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 
51 KQTGLLGMIG KSERGAATCE AIADNK AVYL MAVGGAAYLV AKAI KSSKVL 
101 AFPELGMEAV YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIGIIPVES 
151 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 201>: 

m048 . seq 

1 ATGCTCAACA AAGGCGAAGA ATTGCCCGTC GATTTCACCA ACCGCCTGAT 
51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGATGAAGTC GTCGGTCCGG 
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101 CAGGTCCGAC CACAGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGTGGC 

201 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG 

251 GCGGCGCGGC GTATCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

301 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 

351 CATGCCCGTA ACCGTCGCCG TAGATAGCAA AGGCGAATCC ATCCACGCCA 

401 CCGCCCCGCG CAAATGGCAG GCGAAAATCG GCATCATCCC CGTCGAATCT 

451 TGA 

This corresponds to the amino acid sequence <SEQ ID 202; ORF 048>: 

m04 8 .pep 

1 MLNKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 
51 EQTDLLGMIG KSERGVATCE AIADNK AVYL MAVGGAAYLV AKAI KSSKVL 
101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIGIIPVES 
151 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 203>: 

a048 . seq 

1 ATGCTCGACA AAGGCGAAGA ATTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGACGAAATC GTCGGCCCAG 

101 CAGGTCCGAC CACCGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGCGGC 

201 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG 

251 GCGGCGCGGC GTATCTCGTG GCA7VAAGCCA TCAAATCTTC CAAAGTCTTG 

301 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 

351 CATGCCCGTA ACCGTCGCCG TAGACAGCAA AGGCGAATCC ATCCACGCCA 

401 CCGCCCCGCC CCAATGGCAG GCGAAAATCG GCATCATCCC CGTCAAATCT 

451 TGA 

This corresponds to the amino acid sequence <SEQ ID 204; ORF 048.a>: 

a04 8.pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEI VGPAGPTTAT RMDKFTRQML 
51 EQTDLLGMIG KSERGT^TCE AIADNKA VYL MAVGGAAYLV AKAI KSSKVL 
101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPPQWQ AKIGIIPVKS 
151 * 

m048/a048 96.0% identity over a 1 50 aa overlap 

10 20 30 40 50 60 

m04 8 . pep MLNKGEELPVDFTNRLIYYVGPVDPVGDEWGPAGPTTATRMDKFTRQMLEQTDLLGMIG 

a048 MLDKGEELPVDFTNRLIYYVGPVDPVGDEIVGPAGPTTATRMDKFTRQMLEQTDLLGMIG 
10 20 30 40 50 60 

70 80 90 100 110 120 

m04 8 . pep KSERGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPV 

llllhllllllltllllllllllllltllllllllllllilliiMllliiiiiiiiii 

a04 8 KSERGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPV 
70 80 90 100 110 120 

130 140 150 

m048 .pep TVAVDSKGESIHATAPRKWQAKIGIIPVESX 
I [ I I I I I I I I I I I I I I : I I I I I I I I I I : I I 
a 0 4 8 T VAV D SKGE S I HATAPPQWQAKI G 1 1 PVKSX 

130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. ^gonorrhoeae 

ORF 048 shows 96.4% identity over a 150 aa overlap with a predicted ORF (ORF 048.ng) 
from N. gonorrhoeae: 
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m048/g048 

10 20 30 40 50 60 

m04 8 . pep MLNKGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIG 

ihiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihii mill 

g04 8 MLDKGEELPVDFTNRLIYYVGPVDPVGDEWGPAGPTTATRMDKFTRQMLKQTGLLGMIG 
10 20 30 40 50 60 



70 80 90 100 110 120 

m04 8 . pep KSERGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPV 

llllhllllllllllllllllllllllllllllllllllllllllllhllllllllll 
g04 8 KSERGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPV 
70 80 90 100 110 120 



130 140 150 

m048 .pep TVAVDSKGESIHATAPRKWQAKIGIIPVESX 

lllllllllllllllllllllllllllllll 
g048 TVAVDSKGESIHATAPRKWQAKIGIIPVESX 
130 140 150 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 205>: 

g049 . seq 

1 ATGCGGGCGC AGGCGTTTGA TCAACCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG GATATTGATT 

101 TGGACGGGCA TCAACGCCTC TTCCGCACCX3 CCTTCGCCGT TTTCCGCAAC 

151 CCCGTCTGCC GCCGTACCGG ATTCTGCCGC ATCGGCGTTT TCCCCGCCCT 

201 CAATCTGTGC GGTTTCAAAT TCGGCACTGT CTTTTTTGGC ATCGAACCGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAAccggca tTTGCAGGGA 

301 AGCCTgcgcg TTGAGCCAGT TrrCCTGAAG GACGATCATC GGGTCGGTTT 

351 CGACTTCCTC GCCGCAATCG GCAACGGCgc tGTTGTGTTC TTCCTGCCAT 

401 TTCTTCAGAT ACGCCTTTAA 



This corresponds to the amino acid sequence <SEQ ID 206; ORF 049.ng>: 

g049 .pep 

1 MRAQAFDQPF GQLLFGOAEH FAPVDGFRVQ DIDLDGHQRL FRTAFAVFRN 

51 PVCRRTGFCR IGVFPALNLC GFKFGTVFFG lEPDSPPRFD VFFRNRHLQG 

101 SliRVEPVFLK DDHRVGFDFL AAIGNGAWF FLPFLQIRL* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 207>: 



m049 . seq 


(partial) 










1 


ATGCGGGCGC 


AGGCGTTTGA 


TCAGCCGTTC 


GGTCAGCTCC 


TGTTCGGACA 


51 


GGCAGAACAC 


TTCGCGCCGG 


TTGACGGCTT 


TCGGGTTCAG 


GATATTGATT 


101 


TGGACGGGCA 


TCAACGTTTC 


TTCCGCATCG 


TTTTCCCCGT 


TTTCCGAAAC 


151 


CGCCGGCTCA 


TTCGTGCCGG 


ATTCTGCCTC 


GTCGGCGTTT 


TCCCCGCTTT 


201 


CAATCTGTCC 


GGTTTCAAAT 


TCGACACTGT 


CTTTTTTGGT 


ATCAAACCGG 


251 


ATTCTCCGCC 


GCGATTCGAT 


GTGTTTTTCC 


GAAACCGACA 


TTTGCAGGGA 


301 


AGCCTGCGCG 


TTGAGCCAGT 


TTTCCTGAAG 


GACGATCATC 


GGGTCGGTTT 


351 


CGACTTCCTC 


GCCGCAATCG 


GCAACGGCGG 


CATTGTGTTC 


CTCCTGCCAT 


401 


TTTTTCAGAT 


ACGCCTT. . . 









This corresponds to the amino acid sequence <SEQ ID 208; ORF 049>: 

m049.pep (partial) 

1 MRAQAFDQPF GQLLFGOAEH FAPVDGFRVQ DIDLDGHQRF FRIVFPVFRN 

51 RRLIRAGFCL VGVFPAFNLS GFKFDTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFDFL AAIGNGGIVF LLPFFQIRL. . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 209>: 

a049.seq 
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1 ATGCGGGCGC AGGCGTTTGA TCAGCCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG AATATTGATT 

101 TGGACGGGCA TCAACGCTTC TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 

151 CCCGTCTGCC GCCGTACCCG ATTCTGCCGC ATCGGCGTTT TCCCCGCCTT 

201 CAATCTGTCC GGTTTCAAAT TCGGCACTGT CTTTTTTGGC ATCAAACCGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAACCGACA TTTGCAGGGA 

301 AGCCTGCGCG TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 

351 CGACTTCCTC GCCGCAATCG GCAACGGCGG CATTGTGTTC CTCCTGCCAT 

4 01 TTTTTCAGAT ACGCCTT 



This corresponds to the amino acid sequence <SEQ ID 210; ORF 049.a>: 

a04 9.pep 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ NIDLDGHQRF FRTAFAVFRN 

51 PVCRRTRFCR IGVFPAFNLS GFKFGTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGGIVF LLPFF QIRL 

m049/a049 90.6% identity over a 139 aa overlap 



10 20 30 40 50 60 

m04 9 . pep MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRFFRIVFPVFRNRRLIRAGFCL 

llllllllillllllllillllllllllll:|||||ltl]|| :| nil I: II 
a04 9 MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQNIDLDGHQRFFRTAFAVFRNPVCRRTRFCR 
10 20 30 40 50 60 

70 80 90 100 110 120 

in04 9 . pep VGVFPAFNLSGFKFDTVFFGIKPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 
: I I I I i I I I i I I i i lllllllllllllllilllllllinilllllllMIIIIIIIII 
a04 9 IGVFPAFNLSGFKFGTVFFGIKPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 

70 80 90 100 110 120 

130 139 
m04 9 . pep AAIGNGGIVFLLPFFQIRL 
I I II I I M i II II I i II II 
a04 9 AAIGNGGIVFLLPFFQIRL 

130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 049 shows 86.3% identity over a 139 aa overlap with a predicted ORF (ORF 049.ng) 
from A^. gonorrhoeae: 

Tn049/g049 



10 20 30 40 50 60 

m04 9 . pep MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRFFRI VFPVFRNRRLIRAGFCL 

iiiiiMMiiiiiiiiiiiiiiiiiiiiiiiiiiiiihii :i nil hill 

g049 MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRLFRTAFAVFRNPVCRRTGFCR 
10 20 30 40 50 60 



70 80 90 100 110 120 

m04 9 . pep VGVFPAFNLSGFKFDTVFFGIKPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 

Hlllhll MM lllllhlllllllllllllllllllllllllllllllllllMI 

g04 9 IGVFPALNLCGFKFGTVFFGIEPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 
70 80 90 100 110 120 



130 139 
m04 9 . pep AAIGNGGIVFLLPFFQIRL 

lllllhMIMIhllll 
g04 9 AAIGNGAWFFLPFLQIRLX 
130 140 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 21 1>: 



1 atgggcgCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGg 

51 cacgcccGAA AAAGccgtgt TGATGGcaaA AGAATCCCTG ATGAGCCACA 

101 TCGAcatCca aGaATTGCAG GAAAAAGCCG CGTccggggc ggaattgtcc 

151 accaccgaAG CcCTGCGCCT cGAACTCTTT GAAAAGGTCA ACGCGCTGGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GAC6TGAAAA 

251 TCCTCGATTA CCCGACCCAT GCCGCCTCCA AACCGATTGC CATGATTCCC 

301 AACTGTGCcg CcacCCGcca cgtcgAATTT GAATTGgACG GCTCAGGtCC 

351 TGTCGAactc acgccGCcgc gtgtCGAAGA CTGA 



This corresponds to the amino acid sequence <SEQ ID 212; ORF 050.ng>: 

g050 .pep 

1 MGAGWCPPGI LGIGIG6TPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLEtiF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVED* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 213>; 



1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGTATCG GCATCGGCGG 

51 C. .agCCgAA AAAGCCGTGC TGATGGCAAA AGAGTCCCTG ATGAGCCACA 

101 TCGACATTCA AGAATTGCAG GAAAAGGCCG CGTCCGGCGC GgAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTGGG 

201 CATCGGCGCA CAAGGCTTGG GCGGACTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA TCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

301 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

351 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA TGGCCCGATT TGA 



This corresponds to the amino acid sequence <SEQ ID 214; ORF 050>: 

mOSO.pep 

1 MGAGWCPPGI LGIGIGGXAE KAVIMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDGPI * 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 215>: 



1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGG 

51 TACGCCCGAA AAAGCCGTGT TGATGGCGAA AGAATCCCTG ATGAGCCACA 

101 TCGACATCCA AGAATTGCAG GAAAAAGCCG CGTCCGGCGC GGAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAATVAAGTCA ACGCGCTAGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA CCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

301 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

351 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA CTGGCCC 



This corresponds to the amino acid sequence <SEQ ID 216; ORF 050.a>: 

aOSO.pep 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDWP 

m050/a050 97.7% identity over a 129 aa overlap 



g050 . seq 



mOSO. seq 



a050. seq 



mOSO.pep 



aOSO 



10 20 30 40 50 60 

MGAGWCPPGILGIGIGGXAEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 
I i I M I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I t I I I I I 1 M I I I f I I I I I I I i I 
MGAGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m050 . pep EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 
I I I I I I i I I I I I I I I I I M I I I I I I I I I I I I I I I I N I I I I I I I I I I I ] I I I I I I I I I I I 
a050 EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 
70 80 90 100 110 120 

130 
TPPRVEDGPIX 
I I I I I I I I 
TPPRVEDWP 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 050 shows 98.4% identity over a 127 aa overlap with a predicted ORF (ORF OSO.ng) 
from N. gonorrhoeae: 

m050/g050 

10 20 30 40 50 60 

MGAGWCPPGILGIGIGGXAEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

IIIIIIIIIIIIIMIh lllllllllllllllllllllllllllllllllllllllll 

MGAGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 
10 20 30 40 50 60 

70 80 90 100 110 120 

EKVNALGIGAQGLGGLTTVLDVKIIiDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

IIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

EiCVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 
70 80 90 100 110 120 

130 
TPPRVEDGPIX 

lllllll 
TPPRVEDX 



mOSO . pep 
a050 



m050 .pep 
g050 

m050 .pep 
g050 

m050 .pep 
g050 



The following partial DNA sequence was identified mK gonorrhoeae <SEQ ID 217>: 

gOSO-l.seq 

1 ATGACCGTTA TCAAGCAAGA AGACTTTATT CAAAGTATCT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCAAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCGGA CATGAGCGTG GAAAAGATGG TTAACGAAGG CGTACGCCGC 

301 GCCTACACTT GGGAAGGCAA CACCCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCACA 

401 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGCTCTG AAAACAAATC CAAACTCGCT ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATCCC GACGATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGCAcgcC CGAAAAAGCC 

601 GTGTTGATGG cgaAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAG GTCAACGCGC TGGGCATCGG CGCGCAAGGC 

751 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCATGCCGCC TCCAAACCGA TTGCCATGAT TCCCAACTGT GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GTCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGACC CGATCTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGATAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCGCACAAAC GCCTCGTCAA TATGCTCGAC AAAGGCGAGG AGTTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCC GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTTACCC GCCAAATGCT CAAACAAACC GGCCTCTTGG GCATGATCGG 
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1251 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAGG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CATACCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGTA TGGAAGCCGT 

1401 TTACGAATTT GAAGTCAAAG ATATGCCCGT AACCGTCGCC GTGGACAGCA 

1451 AAGGCGAATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCGAGTC TTGA 



This corresponds to the amino acid sequence <SEQ ID 21 8; ORF 050-1. ng>: 

gOSO-l.pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG lATVFLKVGM DVQWDADMSV EKMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGG KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVED*PDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVNMLD KGEELPVDFT NRLIYYVGPV DPVGDEVVGP AGPTTATRMD 

401 KFTRQMLKQT GLLGMIGKSE RGAATCEAIA DNKAVYLMAV GGAAYLVAKA 

4 51 IKSSKVLAFP ELGMEAVYEF EVKDMPVTVA VDSKGESIHA TAPRKWQAKI 

501 GIIPVES* 



g050-l/pl4407 

sp|P14407|FUMB_ECOLI FUMARATE HYDRATASE CLASS I, ANAEROBIC (FOMARASE) 
>gi 1280063 Ipirl IB44511 fumarate hydratase (EC 4.2.1.2) fumB, iron-dependent - Escherichia coli 
>gi 1 146048 (M27058} anaerobic class I fumarase {EC 4.2.1.2) [Escherichia coli] Length = 548 

Score « 172 bits (432), Expect « 4e-42 

Identities » 138/488 (28%), Positives = 216/488 (43%), Gaps =» 22/488 (4%) 



Query: 11 QSICDAFQFI SYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAENNRPICQDTG 70 

Q+ DA + H K L+ E + K Q L NS + A+ P CQDTG 

Sbjct: 53 QAFHDASFMLRPAHQKQVAAILHDPEASEND— KYVALQFLRNSEIAAKGVLPTCQDTG 109 

Query: 71 lATVFLKVGMDVQWDADMSVEKMVNEGVRRAYTWEGNTLRASVLADPAGKRQNTKDNTPA 130 

A + K G V W E+ +++GV YEN + A KNTNPA 

Sbjct: 110 TAUVGKKGQRV-WTGGGD-EETLSKGVYNTYI-EDNLRYSQNAALDMYKEVNTGTNLPA 166 

Query: 131 VIHMSIVPGGKVEVTCAAKGGGSENKSKL AMLNPSDNIVDWVLKTIPTMGAGWCP 185 

I + V G + + C AKGGGS NK+ L A+L P + +++++ + T+G CP 

Sbjct: 167 QIDLYAVDGDEYKFLCVAKGGGSANKTYLYQETKALLTPG-KLKNFLVEKMRTLGTAACP 225 

Query: 186 PXXXXXXXXXTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEKVNXXX 245 

P T + L + +H EL + + L EL E+ 

Sbjct: 226 PYHIAFVIGGTSAETNLKTVKLASAHY-YDELPTEGNEHGQAFRDVQLEQELLEEAQKLG 284 

Query: 246 XXXXXXXXXTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSG PVELTPP 301 

D++++ P H AS P+ M +C+A R+++ +++ G +E P 

Sbjct: 285 LGAQFGGKYFAH-DIRVIRLPRHGASCPVGMGVSCSADRNIKAKINREGIWIEKLEHNPG 343 

uery: 302 RVEDXPDLTYSPDNGKRVDVDKLTKE EVASWKTGDVLLLNGKILTGRDAAHKRLVNM 358 

+ +VD+++ KE +++ + L L G 1+ GRD AH +L + 

Sbjct: 344 QYIPQELRQAGEGEAVKVDLNRPMKEILAQLSQYPVSTRLSLTGTIIVGRDIAHAKLKEL 403 

Query: 359 LDKGEELPVDFTNRLIYYXXXXXXXXXXXXXXXXXTTATRMDKFTRQMLKQTGLLGMIGK 418 

+D G+ELP + lYY TTA RMD + + G + M+ K 

Sbjct: 404 IDAGKELPQYIKDHPIYYAGPAKTPAGYPSGSLGPTTAGRMDSYVDLLQSHGGSMIMLAK 463 

Query: 419 SERGAATCEAIADNKAVYLMAVGG-AAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPV 477 

R +A + YL ++GG AA L ++IK + +A+PELGMEA+++ EV+D P 

Sbjct: 464 GNRSQQVTDACHKHGGFYLGSIGGPAAVLAQQSIKHLECVAYPELGMEAIWKIEVEDFPA 523 



Query: 478 TVAVDSKG 485 

+ VD KG 
Sbjct: 524 FILVDDKG 531 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 219>: 

mOSO'l . seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTATC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCTGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCGGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG 7VACGTCCAAT 

251 GGGATGCGGA CATGAGCGTG GAAGAGATGG TTAACGAAGG CGTACGCCGC 
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301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCATA 

401 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGCTCTG AAAACAAATC CAAACTCGCC ATGCTCAATC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATCCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGT ATCGGCATCG GCGGCACGCC CGAAAAAGCC 

601 GTGCTGATGG CAAAAGAGTC CCTGATGAGC CACATCGACA TTCAAGAATT 

651 GCAGGAAAAG GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TGGGCATCGG CGCACAAGGC 

751 TTGGGCGGAC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTATCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCAAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT' CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCG GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

1251 CAAATCCGAG CGCGGCGTGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGGC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

1401 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGATAGCA 

1451 AAGGCG7VATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCGAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 220; ORF 050- 1>: 

m050-l .pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID 

51 ILVNSRMCAE NNRPICQDTG lATVFLKVGM 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA 

351 AHKRLVDMLN KGEELPVDFT NRLIYYVGPV 

4 01 KFTRQMLEQT DLLGMIGKSE RGVATCEAIA 

451 IKSSKVLAFP ELGMEAIYEF EVKDMPVTVA 

501 GIIPVES* 

m050-l/g050-l 98.2% identity in 507 aa overlap 

10 20 30 40 50 60 

mO 5 0-1. pep MTVIKQEDFI QSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 
I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I t M I I I I I I I I I I M I I i 
g050-l MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m050-l . pep NNRPICQDTGIATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
I I I I I I I I I I I I I I I I I I I I : I I I [ I I I I I I : I i I I I I I I I I 1 I I I I I I I I I I I I I I I I I . 
g050-l NNRPICQDTG I ATVFLKVGMDVQWDADMSVEKMVNEGVRRAYTWEGNTLRASVLADPAGK 

70 80 90 100 110 120 

130 140 150 150 170 180 

ra050-l . pep RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 
I I I I I It I [ I I I I I I I I i I I I I I I I i I I I I I I I I 1 I I I I I I I I I i t I I I I I I I ) I I I I 1 I 
g050-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m050-l . pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 
M I I I I i I I I I I I I t I I I I I I I I I I I I I t I I I I I I I M i I I I I I I t I I I I I I I t i I I I I I 
g050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m050-l . pep VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 
I t I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I 1 I I I I I I t I t I I I I I I I I I 
gO 5 0- 1 VNALG I GAQGLGGLTT VLDVKI LDYPTHAASKPI AMI PNCAATRH VEFELDGSGPVELTP 

250 260 270 280 290 300 



ALYKAWQKEE 
NVQWDADMSV 
VIHMSIVPGG 
AGWCPPGILG 
EALRLELFEK 
AATRHVEFEL 
SWKTGDVLLL 
DPVGDEWGP 
DNKAVYLMAV 
VDSKGESIHA 



NPAAKDAMTQ 
EEMVNEGVRR 
KVEVTCAAKG 
IGIGGTPEKA 
VNALGIGAQG 
DGSGPVELTP 
NGKILTGRDA 
AGPTTATRMD 
GGAAYLVAKA 
TAPRKWQAKI 
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310 320 330 340 350 360 

mO 5 0- 1 . pep PRVEDWPDLTYS PDNGKRVDVDKLTKEEVASWKTGDVLLLNGKI LTGRDAAHKRLVDMLN 
I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I : 
g050-l PRVEDXPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVNMLD 

310 320 330 340 350 360 

370 380 390 400 410 420 

mO 5 0-1. pep KGEELPVDFTNRLIYYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 
lllllllllllltllllMIIIIIIIIIIMIItllMIIIIIIIIMil IIIIIIMI 
g050-l KGEELPVDFTKRLIYYVGPVDPVGDEVVGPAGPTTATRMDKrrRQMLKQTGLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

m050-l.pep RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPVTVA 
ll:|||||||illllllllllllllltlllllilllllll)lltll:lltllilllllll 
g050-l RGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPVTVA 

430 440 450- 460 470 480 

490 500 
ra050-l .pep VDSKGESIHATAPRKWQAKrGIIPVESX 
I I I i I I I I I I I M I I I I I I I I I I I I I I I 
gO 5 0 - 1 VDS KGES I H ATAPRKWQAKI G 1 1 PVESX 

490 500 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 22 1>: 

a050-l.seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AACCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGCAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGATACCGGT ATCGCGACCG TGTTTTTGAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCAGA CATGAGCGTC GAAGAGATGG TTAACGT^GG CGTGCGCCGC 

301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTTC TCGCCGACCC 

351 CGCCGGCAAA CGCCAAAATA CCAAAGACAA CACGCCCGCC GTCATCCATA 

401 TGAGCATCGT GCCGGGCGAC AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGTTCTG AAAACAAATC CAAACTCGCC ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATTCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGTACGCC CGAAAAAGCC 

601 GTGTTGATGG CGAAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAA7VA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TAGGCATCGG CGCGCAAGGC 

751 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACTACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCGAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGACGAAAT CGTCGGCCCA GCAGGTCCGA CCACCGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

1251 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGGC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

1401 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGACAGCA 

1451 AAGGCGAATC CATCCACGCC ACCGCCCCGC CCCAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCAAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 222; ORF 050-l.a>: 

a050-l.pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDVID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG lATVFLKVGM DVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGD KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG I6IGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL EKSSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDMLD KGEELPVDFT NRLIYYVGPV DPVGDEIVGP AGPTTATRMD 

401 KFTRQMLEQT DLLGMIGKSE RGAATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAIYEF EVKDMPVTVA VDSKGESIHA TAPPQWQAKI 

501 GIIPVKS* 
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a050-l/ia050-l 98.4% identity in 507 aa overlap 

10 20 30 40 50 60 

a050-l . pep MTVIKQEDFIQSICDAFQFISYYHPKDYIDAIiYKAWQKEENPAAKDAMTQILVNSRMCAE 
I I I I I I M I I I I I I I I I I I I I I t I I I 1 I I i I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I 
raO 5 0- 1 MTVI KQEDFI QS ICDAFQFI S YYHPKDYI DALYKAWQKEENPAAKDAMTQILVNSRMCAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

aO 50-1. pep NNRPICQDTGIATVFLKVGMDVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
I I I I I i I I I I i I I I I I I I I I : I I I I I I I I I I I I n I M I I I ) I I t I I M I I I I I I I I I I I 
m050-l NNRPICQDTGIATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLEU^SVLADPAGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

aO 50-1. pep RQNTKDNTPAVIHMSIVPGDKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 
I I I I I t I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I t I I I I t I I I I I I I I I I I 
m050-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

a050-l.pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 
I I 1 I I I I I I I I I I I I I I I I I I I I ) I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I 
m050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 ^ 260 270 280 290 300 

a050-l.pep VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 
I I I I I I I I t I I i I I I i I I I I I I I I I I I i I I I I I I I I I I I i I I I I I I I I I I I i 1 I I t I I I I 
iriO 50 - 1 VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPI AMI PNCAATRHVEFELDGSGPVELTP 

250 260 270 280 290 300 

310 320 330 340 350 360 

a050-l . pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLD 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i : 
mOSO-l PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLN 

310 320 330 340 350 360 

370 380 390 400 410 420 

a050-l . pep KGEELPVDFTNRLIYYVGPVDPVGDEIVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 
lllllllllllllljllllllllllt:lilllilllllilllllllllllltll|||)|| 
m050-l KGEELPVDFTNRIIYYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

a050-l . pejp RGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPVTVA 
I I : i I I I I I { I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
mOSO-l RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPVTVA 

430 440 450 460 470 480 

490 500 
a050-l.pep VDSKGES I HAT APPQWQAK I GII PVKSX 

IIIIIII1IIIM 
mO 5 0 - 1 VDSKGES I HAT APRKWQAK I GI I PVESX 

490 500 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 223>: 

g052 . seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

351 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 224; ORF 052.ng>: 

g052 .pep 
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1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVIiALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 225>: 

m052 . seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

351 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 226; ORF 052>: 

m052 .pep 

1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVGD TR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 227>: 

a052 . seq 

1 ATGGCTTTGG TCGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGAGAGCCG ACAGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCTCCC 

151 AAGGGATTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCGTTTA TATCAGTCGG CGACACGTGA CTCACTTCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAATCAC ATGGTCGCCC GCCTGCAAAA AGGTGTVAAAA 

351 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 228; ORF 052.a>: 

a052 . pep 

1 MALVAEETEI SAPCFKG*EP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DT* LTSMPNLVTM LLIKPTWPN 

101 RLRLEITWSP ACKKVKNAA* 



m052/a052 95.8% identity over a 119 aa overlap 



10 20 30 40 50 60 

m052 . pep MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 
I I I I I I M I i I I I I I I I I I M I i I I M I I I I I 1 I I I I I I I I I t I I I I I ) t I I I I I t I I I 
a052 MALVAEETEISAPCFKGXEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

10 20 30 40 50 60 



70 80 90 100 110 120 

m052 . pep SLVLALTAAFHSFISVGDTRLTPMPNLVTMLLIKPTWPNRLRLETTWSPACRKVKNAAX 
I I I I I I I I I I I I I I I I I I I II I I II II I I I I M II I M I II II t I I I I I : I II I t II 
a052 SLVLALTAAFHSFISVGDTXLTSMPNLVTMLLIKPTWPNRLRLEITWSPACKKVKNAAX 

70 80 90 100 110 120 



Computer analysis of this amino acid sequence gave the following results: 
Homology yyjth a predicted ORF from K gonorrhoeae 

ORF 052 shows 95.8% identity over a 119 aa overlap with a predicted ORF (ORF 052.ng) 
from AT. gonorrhoeae: 

m052/g052 



10 20 30 40 50 60 

m052 . pep MALVAEETE I SAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSP KGLDGVSKNS 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
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g 0 5 2 MALVAE ETE I S APCFKGCE PTGDSRLLS TTKSAPMP CANS AKASKS ATS PKGLDGVS KNS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m052 . pep SLVLALTAAFHSFISVGDTWLTSMPNIATOLLIKPTWPNRLRLEITWSPACKKVKN^^ 

lllllllllllllllllll II lllhlllllllllllllllll lllllhlllllll 
gO 5 2 SLVLALTAAFHS F I SVGDTRLTPMPNLVTMLLI KP1WPNRLRLETTWSPACRKVKNAAX 

70 80 90 100 110 120 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 229>: 

g073 . seq 

1 ATGTGTATGC CATACGCAAT AAGGGTTTCA GACGGCATCT GCCGCATTTT 

51 TCCGCCGATG CCGTCTGAAA CACGCAATCA GCGCGCX^AGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCGACCGA CAGGCGCACC 

151 AGTCCGGGGC GGatacCGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT TGGTGATGGT CGAGCGCACG TCGCCGAGGT 

251 TGGCGGTACG GGAAAAGAGT TCCACGACTT TCCACGCGGC TGCTTGGTCG 

301 GCGACTTCAA AACCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 

351 AAGCTCCGCC TGCGGATGGT CGGGCAATCC GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 230; ORF 073.ng>: 

g073 .pep 

1 MCMPYAIRVS DGICRIFPPM PSETRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 SPGRIPAASF SSGCILPCW VHGLVMV ERT SPRLAVREKS STTFHAAAWS. 

101 ATSKPMTMPP PFCCLRISSA CGWSGNPV* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 23 1>: 

m073 . seq 

1 ATGTGTATGC CATATAAGAT AAGGGTTTCA GACGGCATCT GCTGTCCAAT 

51 GCCGTCTGAA ACACGCAATC AGCGTGCGAG TGCCTGTTTC AAATCGTCAA 

101 TCAAATCGCC AACATATTCC AAACCGACCG ACAGGCGCAC CAATCCGGGG 

151 CGGATGTTGG CGGCGAGTTT TTCTTCGGGC TGCATCCTGC CGTGCGTGGT 

201 TGTCCACGGG TGGGTAATGG TCGAGCGCAC GTCACCGAGG TTGGCGGTGC 

251 GGGAAAAGAG TTCCACGCCG TCCACAACTT TCCACGCGGC TTCTTGATCG 

301 GCAACTTCAA AGCCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 

351 AAGCGCCGCC TGAGGATGGT CGGACAATCC GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 232; ORF 073>: 

m073 .pep 

1 MCMPYKIRVS DGICCPMPSE TRNQRASACF KSSIKSPTYS KPTDRRTNPG 

51 RMLAASF SSG CILPCWVHG WVMV ERTSPR LAVREKSSTP STTFHAASXS 

101 ATSKPMTMPP PFCCLRISAA XGWSDNPV* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 233>: 

a073 . seq 

1 ACGTGTATGT CATATAAGAT AAGGGTTTCA GACGGCATTT GCGGTGTTTT 

51 TCCGCCGATG CCGTCTGAA, CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCGACCGA CAGGCGCACC 

151 AATCCGGGGC GGATGTTGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT GGGTAATGGT CGAGCGCACG TCGCCGAGGT 

251 TGGCGGTACG GGAG7WWVGT TCGACGCCGT CCACGACTTT CCACGCGGCT 

301 GCTTGGTCGG CGACTTCAAA GCCGATGACG ATGCCGCCGC CGTTTTGCTG 

351 TTTGCGGATA AGCTCCGCCT GAGGATGGTC GGGTAATCCG GTGTAA 

This corresponds to the amino acid sequence <SEQ ID 234; ORF 073.a>: 

a073.pep 

1 TCMSYKIRVS DGICGVFPPM PSEXRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 NPGRMLAASF SSGCILPCW VHGWVMV ERT SPRLAVREKS STPSTTFHAA 

101 AWSATSKPMT MPPPFCCLRI SSA*GWSGNP V* 



xa073/a073 92.3% identity over a 130 aa overlap 
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10 20 30 40 50 

m073.pep MCMPYKIRVSDGICC— PMPSETRNQRASACFKSSIKSPTYSKPTDRRTNPGRMLAASF 

II I I I I II 1 I II li I I I : M I I I I I I I I I I I II I I I I I I M I I I I I I I II I I I I 

a073 TCMSYKIRVSDGICGVFPPMPSEXRNQRASACFKSSIKSPTYSKPTDRRTNPGRMLAASF 

10 20 30 40 50 60 



60 70 80 90 100 110 

m073 . pep SSGCILPCWVHGWVMVERTSPRLAVREKSSTPSTTFHAASXSATSKPMTMPPPFCCLRI 
II I I I I I II I I I i II I I I I I I I I I I I I I II I I 1 I I I II II : I II II II II I I I I t I I II 
a073 SSGCILPCWVHGWVMVERTSPRLAVREKSSTPSTTFHA7VAWSATSKPMTMPPPFCCLRI 
70 80 90 100 110 120 



120 129 
ra073.pep SAAXGWSDNPVX 

I : I I I I I I I I I 
a073 SSAXGWSGNPVX 
130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 073 shows 87.0% identity over a 131 aa overlap with a predicted ORF (ORF 073 .ng) 
from K gonorrhoeae: 

m073/g073 

10 20 30 40 50 

m073 . pep MCMPYKIRVSDGICC- - -PMPSETRNQRASACFKSSIKSPTYSKPTDRRTNPGRMLAASF 

Mill IIIIIIII IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIhllh Mil 

g073 MCMPYAIRVSDGICRIFPPMPSETRNQRASACFKSSIKSPTYSKPTDRRTSPGRIPAASF 
10 20 30 40 50 60 



60 70 80 90 100 110 

m073 . pep SSGCILPCVWHGWVMVERTSPRLAVREKSSTPSTTFHAASXSATSKPMTMPPPFCCLRI 

Illllllllllll IIMIIIIIIIIIIIIII lllll: IIIIMIMIIIIIIIII 
g073 SSGCILPCWVHGLVMVERTSPRLAVREKSST---TFHAAAWSATSKPMTMPPPFCCLRI 
70 80 90 100 110 



m073 -pep 
g073 



120 129 
SAAXGWSDNPVX 

hi III MM 

SSACGWSGNPVX 
120 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 235>: 

g075.seq 

1 ATGCCGCCTT ACTTCATCAC CCTCTTAACG ATGGAAAATA CAAAAAGCGC 

SI GGCGAAAACG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCCGTTTCGG GCAACGCTGC GTTTGCCTGT 

151 GCCGCCAAAG CCAGCGGGGC GGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTT ACGAAATTTT 

251 TAAAAAAATG TGTTTGCGGG CTTTGTGAAG GTTTTAGAGA CCGCCTGCCG 

301 GGCCTCTTAA ACTTAATCTT CTTTTTCGTA GAATCCGAAA ATTACAAATT 

351 CCCCGCCTAT CTCTTCCAAT GCCGAGCTAA AAGCGTCTTC ATAGCTGTCA 

401 TATTTACCGG CTGA 

This corresponds to the amino acid sequence <SEQ ID 236; ORF 075. ng>: 

g075.pep 

1 MPPYFITLLT MENTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNAAFAC 

51 AAKASGAAVT TASFAPYLRQ VLINFMIFSF TKFLKKCVCG LCEGFRDRLP 

101 GLLNLIFFFV ESENYKFPAY LFQCRAKSVF lAVIFTG* 
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The following partial DNA sequence was identified in K meningitidis <SEQ ID 237>: 

. m075.8eq 

1 ATGCCGTCTT ACTTCATCAC TCTCTTAACG ATGGAAAATA CAAAAAGCGC 

51 GGCGAAAATG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCCGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

251 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

301 TCAGTCTTTC AAAAATCCGA ATACGACAAA TTCGTATTGG TTGCCGATTT 

351 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AGTCGTAGAA ATAATCGGCA 

401 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 238; ORF 075>: 

mOTS.pep 

1 MPSYFITLLT MENTKSAAKM PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVMDGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 

Computer analysis of this amino acid sequence gave the following results; 
Homology with a predicted ORF firom N. gonorrhoeae 

ORF 075 shows 65.7% identity over a 137 aa overlap with a predicted ORF (ORF 075.ng) 
fi-om N. gonorrhoeae: 

m075/g075 

10 20 30 40 50 60 

MPSYFITLLTMENTKSAAKMPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 

II IIIIIIIIIIIMIII llllllllllllllllllllllllhllllllll lllll 

MPPYFITLLTMENTKSAAKTPTTIQPASIPSAFAASKAFFAVSGNAAFACAAKASGAAVT 
10 20 30 40 50 60 

70 80 90 100 110 

TASFAPYLRQVLINFMIFSF KKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVAD 

IIIIIIIIIIIIIIIIIIM III: : :| I h: : :| I ::: : | 

TASFAPYLRQVLINFMIFSFTKFLKKCVCGLCEGFRDRLPGLLNLIFFFVESENYKFPAY 
70 80 90 100 110 120 

120 130 
FFQTCVNRFFEWEI IGIGDX 
:|| :: | :| : | 
LFQCRAKSVFI AVI FTGX 
130 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 239>: 

a075 . seq 

1 ATGCCGTCTT ACTTCATCAC TCTCTTAACG ATGGAAAAGA CAAAAAGCGC 

51 GGCGAAAACG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCTGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

251 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

301 TCAGTCTTTC AAAAATCCGA ATACGACAAA TTCGTATTGG TTGCCGATTT 

351 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AGTCGTAGAA ATAATCGGCA 

401 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 240; ORF 075. a>: 

a075.pep 

1 MPSYFITLLT MEKTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVM DGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 



m075 .pep 
g075 

m075 -pep 
g075 

m075 .pep 
g075 
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m075/a075 98.5% identity over a 136 aa overlap 

10 20 30 40 50 60 

MPSYFITLLTMENTKSAAKMPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 
I I I I I i I I I I M : I I I I I I M t I I I I I I I I I I I I I I I I M M I I I I I I I t I I I I I I I I I 
MPSYFITLLTMEKTKSAAKTPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 
10 20 30 40 50 60 

70 80 90 100 110 120 

TASFAPYLRQVLINFMIFSFKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVADFFQT 
I I i I I I I I I M I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I 
TASFAPYLRQVLINFMIFSFKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVADFFQT 
70 80 90 100 110 120 

130 

CVNRFFEWEIIGIGDX 
i I I i I I I I I I I I I I I I I 
CVNRFFEWEIIGIGDX 
130 

The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 241>: 

gOBO . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTAC7UVTT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGGCATTGGG CT^GTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAAGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA TACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTTGCA CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGC7VATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

401 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAA TGTCGTTTTG GACAACGGCA 

551 TCACCGTCAG GCTCGGACGG GAAAAcgaGA TGAAACGCCT CCgGCTTTTT 

601 ACcgAAGCGT GGCAGCATCT gttgcGTAAG AATAAAAATC GGTTATCCTA 

651 TGTGGATATG Aggtataagg acggatttTC agtcccccat gctCCCGACXS 

701 GTTTACCCGA AAAAGAATcc gAAGAATatt gggaacaggt ttgggacata 

751 ttacggcctg gcgtcggaaa cggttcgacg caaatttcaa tcagttatAA 

801 GGGCAGacga acaatggaac AGcagtaa 

This corresponds to the amino acid sequence <SEQ ID 242; ORF 080.ng>: 

g080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKALGSIiA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWNWL DNGITVRLGR ENEMKRLRIiP 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVPH APDGLPEKES EEYWEQVWDI 

251 LRPGVGNGST QISISYKGRR TMEQQ* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 243>: 

m080 .seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGACATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

401 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 

551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 



m075.pep 
a075 

m075 .pep 
a075 

m075.pep 
a075 
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601 ACCGAAGCGT GGCAGCATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 
651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTTCCGACG 
701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 2441; ORF 080>: 

m080 -pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKTLGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWIWL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVRY ASDGLPEKES EE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 080 shows 97.9% identity over a 242 aa overlap with a predicted ORF (ORF OSO.ng) 

from A^. gonorrhoeae: 
tnoso/goeo 

10 20 30 40 50 60 

MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKICT 

llllllllllllllllllllilllllllllllllllllllllllllllllMlhlllll 

MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKALGSl^ 
10 20 30 40 50 60 

70 80 90 100 110 120 

KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 
70 80 90 100 110 120 

130 140 150 160 170 180 

EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIWL 
llllllllllllllllllllllllllllllllllllllllllllllllllllllll III 
EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWNVVL 
130 140 150 160 170 180 

190 200 210 220 230 240 

DNGITVRLGRENEMKRIiRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYASDGLPEKES 
llllllllllllllllllllllllllllllllllllllllllllllli :| llllllll 
DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVPHAPDGLPEKES 
190 200 210 220 230 240 



m080 .pep 
OBO 

mOSO .pep 
080 

m080 .pep 
080 

m080 .pep 
080 



m080 .pep 
080 

250 260 270 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 245>: 

aOSO.seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTAGTTTAT 

151 TCCGATAAGA AAGCATTGGG CAGTTTGGCG TU^GAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGTTTGGAC AGACCCGGAA 

401 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 



EEX 

II 

EEYWEQVWD I LRPGVGNGSTQ I S I S YKGRRTMEQQX 



wo 99/57280 



264 



PCT/US99/09346 



551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 

601 ACCGAAGCGT GGCAACATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 

651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTCCCGACG 

701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 246; ORF 080.a>: 

a080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWIWL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVRY APDGLPEKES EE* 

m08o/a080 99.2% identity over a 242 aa overlap 

10 20 30 40 50 60 

mOaO . pep MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKTLGSLA 
I I I I I I I I I I I I I I i I ) I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I t I I I I M : I I I I I 
a080 MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKALGSLA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m080 . pep KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 
I I I I I M I I M I I I M I I I I I I I I I I I I I t I I I I I I M t I t I I I I I I I I I I I I I I I I I I I 
a080 KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 

70 80 90 100 110 .120 

130 140 150 160 170 180 

m08 0 . pep EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWI WL 

I t t I I I I I I I t I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I 
a080 EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIWL 
130 140 150 160 170 180 

190 200 210 220 230 240 

mOBO . pep DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYASDGLPEKES 

1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 

aOBO DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYAPDGLPEKES 
190 200 210 220 230 240 



mOSO.pep EEX 
I I I 

a080 EEX 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 247>: 
gOBl .seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AACAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGATA 

101 TTCGGGAAGG CGATGTGTTT TTCGCATTGG CGGGCGGGCG GTTTGACGCG 

151 CATGATTTTG TTGGAGGCGT ATTGTCTGCG GGCGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGCGCGG CTTTGGGCGG CGCGTTGAAA GTCGATGACA 

251 CGCTTGCCGC GTTGCAAACG TTGGCGAAGG CGTGGCGCGA TAATGTGAAC 

301 CCGTTTGTGT TCGGCATTAC CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 

351 GATGCTGGCT GCGGTATTGC GCCGCCGTTT CGGCGATGAT GCCGTTTCGG 

401 CGACGGCAGG CAACTTCAAC AACCACAtcg gaTTGCCGCT GACTTTATTG 

451 AAATtaaAcg aAAAACACCG CTATGCCGTG ATTGAAATGG GCATGAACCA 

501 TTTTGGcgaa ctggcggtTt taacgcaaaT CGCCAAACCC GATGCCGCTT 

551 TGGtcaACAA CGCCCTGCGC GCCCATGTCG GATGCGGTTt cgacggagtg 

601 GGCGATATTG CCAAAGcgaa aagcGAGATT TatgcagGct tATGTTCAGA 

651 CGGCATGGCA CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

701 CGGCAACGTT TAATTTGAAT ACGTGCACTT TCGGCGTCGA TAGCGGCGAT 

751 GTCCGCGCGG AAAATATCGT GCTGAAACCT TTGTCGTGCG AATTTGATTT 
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801 GGTGTGCGGC GACGAGCGCA CTGCCGTGGT GCTGCCTGTT CCCGGCCGCC 

851 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCCGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG CAAGGCTTCA GCAACATCAA 

951 AGGCCGTCTG AACGTCAAAG CCGGCATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAATCCC GACAGTATGA AAGCCGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG CATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAGGAC gaAGCCGCCG CCATGCACGC CGAagtcgGC GCGTACGCCC 

1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG TCGGCGACAA CAGCGTCGAA 

1201 GCGGcggaAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

1251 GTTGATTCAA GTGTTGAGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

1301 TGAAAGGTTC GCGCTTTATG CAGAtggAAG AAGTGGTCGA GGCATTGGAG 

1351 GATAAGTga 

This corresponds to the amino acid sequence <SEQ ID 248; ORF 081.ng>: 

gOSl.pep 

1 MKPLDLNFIC QALKLPMPSE NKPVSRIVTD SRDIREGDVF FALAGGRFDA 

51 HDFVGGVLSA GAAAVW5 RE DCAALGGALK VDDTLAALQT LAKAWRDNVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVSATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV lEMGMNHFGE LAVLTQIAKP DAALVNNALR AHVGCGFDGV 

201 GDIAKAKSEI YAGLCSDGMA , LIPQEDANMA VFKTATFNLN TCTFGVDSGD 

251 VRAENIVLKP LSCEFDLVCG DERTAWLPV PGRHNVHNAA AAAALALAAG 

3 01 LSLNDVAEGL QGFSNIKGRL NVKAGIKGAT LIDDTYNANP DSMKAAVDVL 

351 ARMPAPRIFV MGDMGELGED EAAAMHAEVG AYARDQGIEA AYFVGDNSVE 

401 AAEKFGADGL WFAAKDPLIQ VLSHDLPERA TVLVKGSRFM QMEEWEALE 

451 DK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 249>: 

mOSl . seg 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGAGCG GTTTGACGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCT GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGTGCTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

251 CGCTTGCCGC ATTGCAAACG CTGGCAAAGG CGTGGCGTGA AAATGTGAAT 

3 01 CCGTTTGTGT TCGGCATTAC CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 

351 AATGCTGGCT GCGGTATTGC GCCgCCGTTT CGGCGATGAT GCCGTGTTGG 

401 CGACX5GCAGG CAACTTCAAC AACCATATCG GATTGCCGCT GACTTTGTTG 

451 AAGTTAAACG AAAAACACCG CTATGCCGTG ATTGAAATGG GCATGAACCA 

501 TTTCGGCGAA CTGGCGGTTT TAACGCAmAT CGCCAAACCA AATGCCGCAT 

551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 

601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TACCAAGGTT TATGTTCAGA 

651 CGGCATTGCA CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

701 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 

751 GTTCACGCGG AAAATATTGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 

801 GGTGTGCGGC GATGAGCGCG CCGCCGTGGT GCTGCCTGTT CCCGGCCGCC 

851 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCGGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 

951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAACCCT GACAGCATGA AAGCTGCGAT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAACTG GGCGAGGACG AAGCCGCCGC TATGCACGCC GAAGTCGGCG 

1151 CGTATGCCCG CGACCAAGGC ATCGAAGCGG CTTATTTTGT CGGCGACAAC 

1201 AGCGTCGAAG CGGCGGAAAA ATTTGGCGCG GACGGTTTGT GGTTCGCCGC 

1251 CAAAGACCCG TTGATTCAAG TGTTGCGCCA CGATTTGCCC GAACGCGCCA 

13 01 CCGTGTTGGT GAAAGGTTCG CGCTTTATGC AGATGGAAGA AGTGGTCGAG 

1351 GCATTGGAGG ATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 250; ORF 081>: 

m081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGERFDA 

51 HDFVEDVLAA GAAAVWS RE DCAAMEX3ALK VDDTLAALQT LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV lEMGMNHFGE LAVLTXIAKP NAALVNNAMR AHVGCGFDGV 
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201 GDIAKAKSEI 

251 VHAENIVLKP 

301 LSLNDVAEGL 

351 ARMPAPRIFV 

401 SVEAAEKFGA 

451 ALEDK* 



YQGLCSDGIA 
LSCEFDLVCG 
KGFSNIKGRL 
MGDMGELGEL 
DGLWFAAKDP 



LIPQEDANMA 
DERAAWLPV 
NVKSGIKGAT 
GEDEAAAMHA 
LIQVLRHDLP 



VFKTATIiNLN 
PGRHNVHNAA 
LIDDTYNANP 
EVGAYARDQG 
ERATVLVKGS 



TRTFGIDSGD 
AAAAIiALAAG 
DSMKAAIDVL 
lEAAYFVGDN 
RFMQMEEWE 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 081 shows 94.1% identity over a 455 aa overlap with a predicted ORF (ORF OSl.ng) 
from A^. gonorrhoeae: 

m081/g081 

10 20 30 40 50 60 

m081 . pep MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 

IIIIIIIMIIIIIIIIIIhllllllllllllll lllllllll llllllll Ihl 
gOai MKPLDLNFICQALKLPMPSENKPVSRIVTDSRDIREGDVFFALAGGRFDAHDFVGGVLSA 
10 20 30 40 SO 60 

70 80 90 100 110 120 

mO 8 1 . pep GAAAVVVSREDCAAMDGALKVDDTLAAIiQTIAKAWRENVNPFVFGITGSGGKTTVKEMIA 

IIIIIMIIIIIIh lllllllllllllllllllhllllllllllllllllllMIII 

g081 GAAAVVVSREDCAALGGALKVDDTLAALQTIAKAWRDNVNPFVFGITGSGGKTTVKEMLA 
70 80 90 100 110 120 

130 140 150 160 170 180 

m081 . pep AVLRRRFGDDAVLATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTXIAKP 

llllllllllll llllllllllllllllllllllllllllllllllllllllll nil 

g081 AVLRRRFGDDAVSATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGEIiAVLTQIAKP 
130 140 150 160 170 180 

190 200 210 220 230 240 

m081 .pep NAALVNNAMRAHVGOSFDGVGDIAKAKSEIYQGLCSDGIALIPQEDANMAVFKTATLNLN 

Hlllllhllllllllllllllllllllil IMIIhlllllMlllllllllhlll 
g081 DAALVNNALRAHVGCGFDGVGDIAKAKSEIYAGLCSDGMAIiIPQEDANMAVFKTATFNLN 
190 200 210 220 230 240 



250 260 270 280 290 300 

m081 .pep TRTFGIDSGDVHAENIVLKPLSCEFDLVCGDERAAWLPVPGRHNVHNAAAAAALAIiAAG 

I llhllllhlllllllMlllllllllllhllllllllllllllllllllllllll 

g081 TCTFGVDSGDVRAENIVLKPLSCEFDLVCGDERTAWLPVPGRHNVHNAAAAAALALAAG 
250 260 270 280 290 300 

310 320 330 340 350 360 

mOai.pep LSLNDVAEGLKGFSNIKGRliNVKSGIKGATLIDDTYNANPDSMKAAIDVLARMPAPRIFV 

lllllllllhlllllllllllhMMIIIIIIIIIIIIMIIIhMlllllllllll 

g081 LSLNDVAEGLQGFSNIKGRIiNVKAGIKGATLIDDTYNANPDSMKAAVDVLARMPAPRIFV 
310 320 330 340 350 360 

370 380 390 400 410 420 

raOa 1 . pep MGDMGELGELGEDEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 

Mlilllll IIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

g081 MGDMGELGE- - -DEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 

370 380 390 400 410 

430 440 450 

mo 8 1 . pep LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

Hill IIIIIMIIIIIIIIIIIIIIIIIIIIIII 

g081 LIQVLSHDLPERATVLVKGSRFMQMEEWEALEDKX 
420 430 440 450 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 251>: 

a081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGGGCG GTTTGATGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCG GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGCGTTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

251 CGCTTACCGC GTTGCAAATG TTGGCGAAGG CGTGGCGCGA GAATGTGAAC 

301 CCGTTTGTGT TCGGTATTAC CGGCTCGGGC GGCAAGACGA CGGTGAAGGA 

351 AATGTTGGCT GCGGTATTGC GCCGCCGTTT CGGCGATAAT GCCGTTTTGG 

4 01 CGACGGCAGG CAACTTCAAC AACCACATCG GATTGCCGTT GACTTTGTTG 

4 51 AAATTAAACG /W\AACACCG CTATGCCGTG ATTGAAATGG GTATGAACCA 

501 TTTTGGCGAA CTGGCGGTTT TGACACAAAT CGCCAAACCC GATGCCGCAT 

551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 

601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TATC7VAGGCT TATGTTCAGA 

651 CGGCATGGCG CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

701 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 

751 GTCCACGCGG AAAATATCGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 

801 GGTGTGCGGC AACGAGTGCG CAGCCGTGGT TCTGCCCGTT CCCGGCCGCC 

851 ACAATGTCCA CAACGCCGCC GCCGCCGCCG CGCTGTCTTT GGCTGCAGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 

951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAACCCT GACAGCATGA AAGCTGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGTGAGGAC GAAGCCGCCG CCATGCACGC CGAAGTCGGC GCGTACGCCC 

1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG TCGGCGACTVA CAGCGTCGAA 

1201 GCGGCGGAAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

1251 GTTGATTCAA GTGTTGCGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

1301 TGAAAGGTTC GCGCTTTATG CAGATGGAAG AAGTGGTCGA GGCATTGGAG 

1351 GATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 252; ORF 081.a>: 

a081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGGRFDA 

51 HDFVEDVLAA GAAAVWSRE DCVAMDGALK VDDTLTALQM LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDN AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV lEMGMNHFGE LAVLTQIAKP DAALVNNAMR AHVGCGFDGV 

201 GDIAKAKSEI YQGLCSDGMA LIPQEDANMA VFKTATLNLN TRTFGIDSGD 

251 VHAENIVLKP LSCEFDLVCG NECAAWLPV PGRHNVHNAA AAAALSLAAG 

301 LSLNDVAEGL KGFSNIKGRL NVKSGIKGAT LIDDTYNANP DSMKAAVDVL 

351 ARMPAPRIFV MGDMGELGED EAAAMHAEVG AYARDQGIEA AYFVGDNSVE 

401 AAEKFGADGL WFAAKDPLIQ VLRHDLPERA TVLVKGSRFM QMEEWEALE 

451 DK* 

m08i/a08i 96.7% identity over a 455 aa overlap 

10 20 30 40 50 60 

m081 . pep MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 
I I I I i I i I I M I I I I I I I I I I I I I i I I I t I I I I I M I I I I I I I 1 I M I I I I I i I I i I I I 
aOBl MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGGRFDAHDFVEDVLAA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m081 . pep GAAAWVSREDCAAMDGALKVDDTLAALQTLAKAWRENVNPFVFGITGSGGKTTVKEMLA 
I i ) I I I I M I t I : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I 
a081 GAAAVWSREDCVAMDGALKVDDTLTALQMLAKAWRENVNPFVFGITGSGGKTTVKEMLA 

70 80 90 100 110 120 

130 140 150 160 170 180 

mOBl . pep AVLRRRFGDDAVLATAGNFNNHXGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTXIAKP 
I I I I I I I I I : I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I i I I I I I t I I I I I I I I I 
a081 AVLRRRFGDNAVLATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTQIAKP 
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130 140 150 160 170 180 

190 200 210 220 230 240 

m081 . pep N7VALVNNAMRAHVGCGFDGVGDIAKAKSEIYQGLCSDGIALIPQEDANMAVFKTATLNLN 
: I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I h I I I I I I I I I I I I I I I I I I I j I 
a081 DAALVNNAMRAHVGCGFDGVGDIAKAKSEIYQGLCSDGMALIPQEDANMAVFKTATLNLN 

190 200 210 220 230 240 



250 260 270 280 290 300 

m081 . pep TRTFGIDSGDVHAENIVLKPLSCEFDLVCGDERAAWLPVPGRHNVHNAAAAAALALAAG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I 1 I I : I I I I 
a081 TRTFGIDSGDVHAENIVLKPLSCEFDLVCGNECAAVVLPVPGRHNVHNAAAAAALSLAAG 

250 260 270 280 290 300 



310 320 330 340 350 360 

m081 . pep LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAIDVLARMPAPRIFV 
I I I I I I I I I I I I I If I I I I I M I I I I I I I I I I I I I I I I I I I I I I i I : I I I I ] I I I I I I t I 
a081 LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAVDVLARMPAPRIFV 

310 320 330 340 350 360 

370 380 390 400 410 420 

m081 . pep MGDMGELGELGEDEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 
I I I I I I I I I I 1 I I M I I I I I I I I I I I I I I I I I ] I I I I I I I I 1 I I I f I M I I 1 I I I I I 

a081 MGDMGELGE DEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 

370 380 390 400 410 



430 440 450 

mOai .pep LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
aOei LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

420 430 440 450 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 253>: 

g082 .seq 

1 aTGTGGTTGT TGAAGTTGCC TGCCGTCGCC GAAACGGCAT CATCGCCGAA 

51 ACGGCGGCGC AATACCGCAG CCAGCATCTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATGCCGAAC ACAAACGGGT TCACATTATC GCGCCACGCC 

151 TTCGCCAACG TTTGCAACGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 

201 GCCCAAAGCC GCGCAATCTT CGCGCGAAAC CACAACCGCC GCCGCGCCCG 

251 CAGACAATAC GCCTCCAACA AAATCATGCG CGTCAAACCG CCCGCCCGCC 

301 AATGCGAAAA ACACATCGCC TTCCCGAATA TCGCGGCTGT CGGTTACGAT 

351 6CGCGACACG GGTTTGTTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

401 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTGCTTTCGT TAATATTCGG 

451 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 

501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGCATTT TTTCTGTACG 

551 TATCATTTTT TAGACGTATT TTTAGCCGAT TTGCCTTTTC CCGCATACCA 

601 CGGCGCGGGG TCGTCGGACT GTCTGTCGAT AAAGGCAAGG TTATTGCCTT 

651 CGCCCGGCAC ATCGGGGACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 

701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 254; ORF 082.ng>: 
g082 .pep 

1 MWLLKLPAVA ETASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTLSRHA 

51 FANVCNAASV SSTFNAPPKA AQSSRETTTA AAPADNTPPT KSCASNRPPA 

101 NAKNTSPSRI SRLSVTMRDT GLFSDGIGSL RAWQMKFRSS GFIFAFVNIR 

.151 AADTSVAADF FIACFAW KH RLFSHSHSAF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGLSVD KGKVIAFARH IGDIPPKIIA VIGQLVGFDT RPTAESA* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 255>: 

m0a2 . seq 

1 ATGnnGTTGT TGAAGTTGCC TGCCGTCGCC AACACGGCAT CATCGCCGAA 

51 ACGGcGGCGC AATACCGCAG CCAGCATTTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATGCCGAAC ACAAACGGAT TCACATTTTC ACGCCACGCC 

151 TTTGCCAGCG TTTGCAATGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 
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201 ATCCATTGCA GCACAATCTT CGCGCGAAAC CACAACCGCC GCCGCACCAG 

251 CAGCCAATAC GTCTTCAACA AAATCATGCG CGTCAAACCG CTCGCCCGCC 

301 AATGCGAAAA ACACATCGCC CGCGCGGATG TCGCGGCTGT CGGTTACGAT 

351 GCGCGACACG GGTTTGCTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

401 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTACTTTCGT TAATATTCGG 

451 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 

501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGsATTT TTTCTGTACG 

551 TATCATTTTT TAGACGTATT TTTAGTCGAT TTGCCTTTTC CCGCATACCA 

601 CGGCGCGGGG TCGTCGGGCA GTCCGTCGAT AAAGGCAAGG TTATTGCCTT 

651 CGCCCTGCAC ATCGGGAACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 

701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 256; ORF 082>: 

m082.pep 

1 MXLLKLPAVA NTASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTFSRHA 

51 FASVCNAASV SSTFNAPSIA AQSSRETTTA AAPAANTSST KSCASNRSPA 

101 NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFTFVNIR 

151 AADTSVAADF FIACFAW KH RLFSHSHSXF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 082 shows 92.7% identity over a 247 aa overlap with a predicted ORF (ORF 082.ng) 
from N. gonorrhoeae: 

m082/9082 

10 20 30 40 50 60 

MXLLKLPAVANTASSPKRRRNTAASISFTWLPPEPVMPNTNGFTFSRHAFASVCNAASV 

I IIIMIIhllllllllllllllllllMIIIIIIMIIIIIhlllllhlllllll 

MWLLKLPAVAETASSPKRRRNTAASISFTVVLPPEPVMPNTNGFTLSRHAFANVCNAASV 
10 20 30 40 50 60 



m082 .pep 
g082 



70 80 90 100 110 120 

m082 .pep SSTFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRSPANAKNTSPARMSRLSVTMRDT 

lllllll IIIIIIMIIIIIII II Illlllll llllllllh|:||||||||l| 
g082 SSTFNAPPKAAQSSRETTTAAAPADNTPPTKSCASNRPPANAKNTSPSRISRLSVTMRDT 
70 80 90 100 110 120 



130 140 150 160 170 180 

m082 .pep GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRA7UDTSVAADFFIACFAWKHRLFSHSHSXF 

IhllllllllllllllllllllhlllllllllllllllllllllllllllllMII I 

g082 GLFSDGIGSLRAWQMKFRSSGFIFAFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 
130 140 150 160 170 180 



190 200 210 220 230 240 

m082 . pep FLYVSFFRRIFSRFAFSRIPRRGWGQSVDKGKVIAFALHIGNIPPKIIAVIGQLVGFDT 

Illlllllllllllllllllllllll lllllllllll llhlllllllllllllllll 
g082 FLYVSFFRRIFSRFAFSRIPRRGWGLSVDKGKVIAFARHIGDIPPKIIAVIGQLVGFDT 
190 200 210 220 230 240 



m082.pep RPTAESAX 

Illlllll 
go 8 2 RPTAESAX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 257>: 

a082 . eeq 

1 ATGTGGTTGT TGAAGTTGCC TGCCGTCGCC AAAACGGCAT TATCGCCGAA 
51 ACGGCGGCGC AATACCGCAG CCAACATTTC CTTCACCGTC GTCTTGCCGC 
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101 CCGAGCCGGT AATACCGAAC 

151 TTCGCCAACA TTTGCAACGC 

201 ATCCATTGCA ACGCAATCTT 

251 CAGCCAATAC GTCTTCAACA 

301 AATGCGAAAA ACACATCGCC 

351 GCGCGACACG GGTTTGCTTT 

401 AGATGAAATT TAGGTCCAGT 

451 GCGGCGGACA CATCGGTAGC 

501 GGTATU^CAC AGATTATTTT 

551 TATCATTTTT TAGACGTATT 

601 CGGCGCGGGG TCGTCGGGCA 

651 CGCCCTGCAC ATCGGGAACA 

701 AACTCGTCGG TTTCGATACC 



ACAAACGGGT TCACATTCTC GCGCCACGCC 
GGTAAGCGTG TCATCGACTT TCAACGCGCC 
CGCGCGAAAC CACAACCGCC GCCGCACCCG 
AAATCATGCG CATCAAACCG CCCGCCCGCC 
CGCGCGGATG TCGCGGCTGT CGGTTACGAT 
CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 
GGTTTCATAT TTACTTTCGT TAATATTCGG 
GGCTGATTTT TTTATCGCCT GTTTTGCTGT 
CCCATTCTCA TTCGGCATTT TTTCTGTACG 
TTTAGTCGAT TTGCCTTTTC CCGCATACCA 
GTCCGTCGAT AAAGGCAAGG TTATTGCCTT 
TTCCCCCAAA AATCATAGCC GTCATCGGGC 
CGTCCAACTG CCGAATCCGC GTM 



This corresponds to the amino acid sequence <SEQ ID 258; ORF 082.a>: 

a082.pep 

1 MWLLKLPAVA KTALSPKRRR NTAANISFTV VLPPEPVIPN TNGFTFSRHA 

51 FANICNAVSV SSTFNAPSIA TQSSRETTTA AAPAANTSST KSCASNRPPA 

101 NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFTFVNIR 

151 AADTSVAADF FIACFAW KH RLFSHSHSAF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 



m082/a082 95.5% identity over a 247 aa overlap 



10 20 30 40 50 60 

in082 . pep MXLLKLPAVANTASSPKRRRNTAASISFTWLPPEPVMPNTNGFTFSRHAFASVCNAASV 
I 11111111:11 I I I I I I I I I I : I I I I I I I I I I i I : I I I I I M I I I I I I I :: I 11 : I I 
a082 MWLLKLPAVAKTALSPKRRRNTAANISFTWLPPEPVIPNTNGFTFSRHAFANICNAVSV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m082 . pep SSTFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRSPANAKNTSPARMSRLSVTMRDT 
I i 1 I I I I I I I : I I I ] I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i i I I I I I I I I I I I I 
a082 SSTFNAPSIATQSSRETTTAAAPAANTSSTKSCASNRPPANAKNTSPARMSRLSVTMRDT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m082.pep GLLSDGIGSLRAWQMKFRSSGFIETFVNIRAADTSVAADFFIACFAWKHRLFSHSHSXF 
I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a082 GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 

130 140 150 160 170 180 



190 200 210 220 230 240 

in082.pep FLYVSFFRRI FSRFAFSRIPRRGWGQSVDKGKVIAFALHIGNIPPKIIAVIGQLVGFDT 

I i I t I I I I I t I I I t ) I t I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I 
a082 FLYVSFFRRIFSRFAFSRIPRRGWGQSVDKGKVIAFALHIGNIPPKIIAVIGQLVGFDT 

190 200 210 220 230 240 



m082,pep RPTAESAX 
I I I I I I I I 
a082 RPTAESAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 259>: 

g084 . seq 

1 ATGAAacaAT CCGcccgaat aAAAAATATG GATCAGACAT TAAAAAATAc 

51 attgggcatt tGCGCGcttt tagcctTTTG TTTTggcgcG gccaXCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGC 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GCTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCTT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 
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401 TTTGGAAATA TTGTGTATCT GTGGGGGTAT TTGCTGACGT AAAAAACTAT 

4 51 AAACGTCGCA GCAAAATATG GCTGACCATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGAGAAAA TCGccggcga taaAGATTGG CGAGaacctg 

551 atgccggcct gttgttgaat ATTTTcgacc tgtattaCga cttggctttc 

601 cgcgccggca cAATATGCCG CCAAGCGCGC CCAcattttg gaagCagcaa 

651 aaaaagcgtC AACATGGCAt atccgccaac ttgcgcccaa gTAtaa 



This corresponds to the amino acid sequence <SEQ ID 260; ORF 084.ng>: 

g084 .pep 

1 MKQSARI KNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYR YSAVG 
51 ALASWFLLL LARGFPRVSS WLLIYVGTT ALYLPVGWLY GAPSYQIVGS 



101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTVW KYCVS VGVFADVKNY 
151 KRRSKIWLTI LLTLILSCAV MEKIAGDKDW REPDAGLLLN IFDLYYDIiAF 



201 RAGTICRQAR PHFGSSKKSV NMAYPPTCAQ V* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 26l>: 



1 ATGAAACAAT CCGCCcGAAT AAAa.ATATG AATCAGACAT TACTTTATAC 

51 ATTGGGCATT TGCGCGCTTT TAACCTTTnn nnnnnnnnnn nnnnnnnnnn 

101 nnnnnTATCA CCCnGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

401 TTTGGAAATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

451 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCT.TC 

601 CGCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 

651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 



This corresponds to the amino acid sequence <SEQ ID 262; ORF 084>: 

m084 .pep 

1 MKQSARIKXM NQTLLYTLGI CALLTFXXXX XXXXX YHPEY EYGY RYSAVG 
51 ALASWFLLL LAR GFP RVSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 



101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTVW KYCVS GGVFADVKNY 
151 KRRS KIWLTI LLTLILSCAV MD KIASDKDL REPDAGLLLN IFDLYYDLAX 
201 RAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 084 shows 90.5% identity over a 231 aa overlap with a predicted ORF (ORF 084.ng) 
from A'', gonorrhoeae: 



ni084 . seq 



m084/g084 



m084 .pep 



10 20 30 40 50 

MKQSARI KXMNQTLLYTLGI CALLTF YHPE YE YGYRYS AVGALAS WFLLL 



llllllll hill lllllllhl 




g084 



MKQSARI KNMDQTLKNTLG I CALLAFCFGAAI ASGYHLE YE YGYRYS AVGALAS WFLLL 



10 20 30 40 50 60 



m084 .pep 



60 70 80 90 100 110 

LARGFPRVSSWLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 



g084 




70 80 90 100 110 120 



m084 .pep 



120 130 140 150 160 170 

YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 



g084 




130 140 150 160 170 180 
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180 190 200 210 220 

m084 . pep REPDAGLLLNIFDLYYDLAXRAGTICRQARPHFGSSKKSVNMAYPSCCAQVX 

iiiiiiiiiiiiiiiiiii iiiiiiiiiiiiiiiiiiiiiiiii mil 

g084 REPDAGLLLNIFDLYYDLAFRAGTICRQARPHFGSSKKSVNMAYPPTCAQVX 
190 200 210 220 230 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 263>: 

a084 . seq 

1 ATGAAACAAT CCGCCCGAAT AAAAAATATG GATCAGACAT TAAAAAATAC 

51 ATTGGGCATT TGCGCGCTTT TAGCCTTTTG TTTTGGCGCG GCCATCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

401 TTTGGAGATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

4 51 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCTTCC 

601 .GCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 

651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 264; ORF 084.a>: 

a084.pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYRYSAVG 

51 ALASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTVW RYCVS GGVFADVKNY 

151 KRRSK IWLTI LLTLILSCAV MD KIASDKDL REPDAGLLLN IFDLYYDLAS 

201 XAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 

m084/a084 92.2% identity over a 231 aa overlap 

10 20 30 40 50 60 

m084 . pep MKQSARIKXMNQTLLYTLGICALLTFXXXXXXXXXYHPEYEYGYRYSAVGALASWFLLL 
111111111:111 I I I I i I I I : I I I I I I I M I I I I I j I I I I 1 I I I I I 

a084 MKQSARIKNMDQTLKNTLGICALLAFCFGAAIASGYHLEYEYGYRYSAVGALASWFLLL 
10 20 30 40 50 60 

70 80 90 100 110 120 

ra084 . pep LARGFPRVSSWLLI YVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I i I I I I I I I I I I i I ! I I 
a084 LARGFPRVSSWLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m084 . pep YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 
I I M I I I I I I i I I I i : I I I I I I I I I I I I I I I I I I 1! I I I I I I I I i I I I I n I I I I I I 1 I I 
a084 YFVQALFFIFGLTVWRYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 

130 140 150 160 170 180 

190 200 210 220 230 

in084 . pep REPDAGLLLNIFDLYYDLAXRAGTICRQARPHFGSSKKSVNMAYPSCCAQVX 
I I I I I I I M I I I I I I I I I I I I I I i I I I I i I I I I I I I I I [ I I I I I I I I I I I 
a084 REPDAGLLLNIFDLYYDLASXAGTICRQARPHFGSSKKSVNMAYPSCCAQVX 

190 200 210 220 230 

The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 265>: 

gOas.seg 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGT TGAAAGATAA 
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51 GGCAAAAGGC GTGTTCCTGA TCGGCGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AACCTGACCG ACTGCGTCAC TTTGGAAGAG 

151 GCGGTTCAGA CX3GCATACGC CCAAGCCGAA GCGGGCGATA TTGTCTTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT tatCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 266; ORF 085.ng>: 

g085 .pep 

1 MGKGQDFTPL RDALKDKAKG VFLIGVDAPQ IRRDLDGCGL NLTDCVTLEE 
51 AVQTAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKAL* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 267>: 

mOSS . seq 

1 ATGGGTAAAG GGCAGGACTT CACGCCCCTG CGCGATGCAC TGGTAGGCAA 

51 GGCAAAAGGC GTGTTCTTGA TTGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AATATGACCG ACTGCGCCAC TTTGGGAGAA 

151 GCCGTTCAGA CGGCATATGC CCAAGCCGAA GCAGGCGATA TTGTGTTGCT 

201 CAGCCCCGCC TGCGCGAGCT TTGATATGTT CAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT TATCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 268; ORF 085>: 

m085.pap 

1 MGKGQDFTPL RDALVGKAKG VFLIGVDAPQ IRRDLDGCGL NMTDCATLGE 
51 AVQTAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKAL* 

Computer analysis of this amino acid sequence gave the following resuhs: 
Homology with a predicted ORF from A^. ^gonorrhoeae 

ORF 085 shows 94.7% identity over a 94 aa overlap with a predicted ORF (ORF 085.ng) 
firom N. gonorrhoeae: 

m085/g085 



m085 .pep 
g085 



m085 .pep 
g085 

70 80 90 



10 20 30 40 50 60 

MGKGQDFTPLRDALVGKAKGVFLIGVDAPQIRRDLDGCGLNMTDCATLGEAVQTAYAQAE 
llllllilllllll llllllllllllllllllllllllhllhll lllllllllll 
MGKGQDFTPLRDALKDKAKGVFLIGVDAPQIRRDLDGCGLNLTDCVTLEEAVQTAYAQAE 

10 20 30 40 50 60 

70 80 90 

AGDIVLLSPACASFDMFKGYAHRSEVFIEAFKALX 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 
AGDIVLLSPACASFDMFKGYAHRSEVFIEAFKALX 



The following partial DNA sequence was identified iniV^ meningitidis <SEQ ID 269>: 

a085 . seq 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGC TTGCCGGCAA 

51 GGCAAAAGGC GTGTTCCTGA TCGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGATCTG AATATGACCG ACTGCGCCAC TTTGGAAGAA 

151 GCGGTTCAGA AGGCATATGC CCAAGCCGAA GCGGGCGATA TCGTGCTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT TATCGGGGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 270; ORF 085.a>: 

aOBS.pep 

1 MGKGQDFTPL RDALAGKAKG VFLIGVDAPQ IRRDLDGCDL NMTDCATLEE 
51 AVQKAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIGA FKAL* 

m085/a085 94.7% identity over a 94 aa overlap 



10 20 30 40 50 60 

Itl08 5 . pep MGKGQDFTPLRDALVGKAKGVFLIGVDAPQIRRDLDGCGLNMTDCATLGEAVQTAYAQAE 
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1 1 1 1 1 1 1 1 i 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 nil 1 1 1 1 1 1 

a085 MGKGQDFTPLRDALAGKAKGVFLIGVDAPQIRRDLDGCDLNMTDCATLEEAVQKAYAQAE 
10 20 30 40 50 60 

70 80 90 

mOBS .pep AGDIVLLSPACASFDMFKGYAHRSEVFIEAFKALX 
I I I I I I t I i I I I I I I I I I I I I I i I I I I I I I I I I I 
a085 AGDIVLLSPACASFDMFKGYAHRSEVFIGAFKALX 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 271>: 

gOB6 .seq 

1 ATGGTGGTGC TGATGACGGC GTTCGGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCGAAGG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GGCAGGCGGG GTTCGTCGTT GCCGGCCTTA TAGCGAGCGG TTTTTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CTTATCCGGC CTGTTGCTGG TAGCCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

3 01 GAGCTGTTCA AGCTGGCAGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

3 51 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

401 GGCGGGGGAC GGCCAACCTG ATTATGTCCG CCACCAATCC GCAGGCACGT 

451 CGTGAAACAT TAGAAATGTA CGgcCGTTTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 

551 GTTCGTTTGT CGTCATTACC GTCATTACCG TTGGAATGCT GTTTCTGGCA 

601 GGATTGCCGT GGAAATATTT TTTTGTCCTG GTAGGCAGCG TCTTGGGTGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCTGAA GAATTCGGCT TCTTCGGGAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGtttgA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGctgCcg tTGATGTCCT ATGGcggTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTTG CGTATCGATT 

1151 ATGAAAACCG CCAGAAAATG CGCGGTTACC GGGTGGAGTA AA 

This corresponds to the amino acid sequence <SEQ ID 272; ORF 086.ng>; 

g086 .pep 

1 MWLMTAFGL LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGLIASGFLW 

51 FLCRMRTWRR LVPWIFALSG LLLVAVLIA G REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF R AIILPIMLV AFGLVLIMV Q PDFGS FWIT VITVGMLFLA 

201 GLPWKY FFVL VGSVLGGMVL MITA APYRVQ RWAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 

301 IFCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SF FNIGVNIG 

351 ALPTKGLTL P LM5Y GGSSVF FMLISMMLLL RI DYENRQKM RGYRVE* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 273>: 

m086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACc 

301 GAGCTGTTCA AGCtGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

351 CCGTGAAGAA GTGTTGcGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

401 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGrCACGT 

451 CGTGAaACAT TAGAAATGTA CGGCCGTwTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 
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551 GTTCGTTTGT CX3TCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGC7VG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGkrTCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA mCAAAgGyCT GACGCyGCCG Tg.AtGTCCw ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTkG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTATC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ E) 274; ORF 086>: 

mOB6 .pep 

1 MWLMTAFSL LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGLIASGLLW 

51 FLCRMRTWRR L VPWIFALSG LLLVWLIA G REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQXR 

151 RETLEMYGRX R AIILPIMLV AFGLVLIMV Q PDFGS FWIT VIAVGMLFLA 

201 GLPW KYFFVL VGSVIiGGMVL MITA APYRVQ RWAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 

301 IFCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGXQ SF FNIGVNIG 

351 ALPXKGLTX P XMSX GGSSVF FMLI5MMLLX RI DYENRRKM RGYRVE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 086 shows 96.7% identity over a 396 aa overlap with a predicted ORF (ORF 086.ng) 
from N, gonorrhoeae: 

mOB6/gOB6 

10 20 30 40 50 60 

MVVLmAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFVVAGLIASGLLWFLCRMRTWRR 

lllllllhlllllllllllllllllllilllllllllllllllllhllllllllllll 
MVVLMTAFGLLMIYSASVYIiASKEGGDQFFYLTRQAGFVVAGLIASGFLWFLCRMRTWRR 
10 20 30 40 50 60 

70 80 90 100 110 120 

LVPWI FALSGLLLVWLIAGREXNGATRWI PLGPLNFQPTELFKLAVILYLASLFTRREE 

IIMIIIIIIMIhllllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

LVPWI FALSGLLLVAVLIAGREINGATRWI PLGPLNFQPTELFKIiAVILYLASLFTRREE 
70 80 90 100 110 120 

130 140 150 160 170 180 

VLRSMESLGWQSIWRGTANLIMSATNPQXRRETLEMYGRXRAIIIiPIMLVAFGLVLIMVQ 

Mllllllllllllllllllllllllll llllllllll llllllllilllllllMII 

VLRSMESLGWQSIWRGTANLIMSATNPQARRETIiEMYGRFRAIILPIMLVAFGLVLIMVQ 
130 140 150 160 170 180 

190 200 210 220 230 240 

PDFGSFWITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 

IIIIIMIIIIhlllllllllllllllllllllllillMIIIIIIIIIIIIIIIIIII 

PDFGSFWITVITVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 
190 200 210 220 230 240 

250 260 270 280 290 300 

DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFI FAX I AEEFGFFGMCVL 

lllllllllllllllllllllllillllllllllllllllllllllllllllllllllll 
DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 
250 260 270 280 290 300 



m086 .pep 
g086 

m086 .pep 
g086 

m086 .pep 
g086 

m086 .pep 
g086 

m086 .pep 
g086 



310 320 330 340 350 360 
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m086 . pep IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGXQSFFNIGVNIGALPXKGLTXP 

IIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII lllllllllllll|:|||| I 
g086 IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGIQSFFNIGVNIGALPTKGLTLP 
310 320 330 340 350 360 



370 380 390 

mO 8 6 . pep XMSXGGSSVFFMLI SMMLLXRIDYENRRKMRGYRVEX 

II lllllllllllllll MllllhllllMIII 

g 0 8 6 LxMS YGGSSVFFMLI SMMLLLRIDYENRQKMRGYRVEX 

370 380 390 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 275>: 

a086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGMTTT CCAGCCGACC 

301 GAGCTGTTCA AGCTGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

351 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

401 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGGCACGT 

451 CGTGAAACAT TAGAAATGTA CGGCCGTTTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 

551 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGCTGCCG TTGATGTCCT ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTTG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTACC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ E) 276; ORF 086.a>: 

a086.pep 

1 MWLMTAF5L LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGLIASGLLW 

51 FLCRMRTWRR L VPWIFALSG LLLVWLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF R AIILPIMLV AFGLVLIMV Q PDFGS FWIT VIAVGMLFLA 

201 GLPWKY FFVL VGSVLGGMVL MIT AAPYRVQ RWAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE E FGFFGMCVL 

301 IFCYGWLVV R AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SFFNIGVNIG 

351 ALPTKGLTLP LMSYGGSSVF FMLISMMLLL RIDYENRRKM RGYRVE* 



m086/a086 98.0% identity over a 396 aa overlap 



10 20 30 40 50 60 

m08 6 . pep MWLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFVVAGLIASGLLWFLCRMRTWRR 
I I I i I I I I I I I I I I i I I I I I I I I i I I I I I I I I I I I I i I I I I I I t I I I I I t I [ i M I I I I I 
a08 6 MWLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFWAGLIASGLLWFLCRMRTWRR 

10 20 30 40 50 60 



70 80 90 100 110 120 

inOee.pep LVPWIFALSGLLLWVLIAGREINGATRWIPLGPLNFQPTELFKLAVILYLASLFTRREE 
M I I I I I I I t I I I I I I I I I I I I I ) M I I I I I I I I I I I I I I I I I j I I I I M I I I I I t I I I I 
a08 6 LVPWIFALSGLLLVWLIAGREINGATRWIPLGPLNFQPTELFKLAVILYLASLFTRREE 
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70 80 90 100 110 120 

130 140 150 160 170 180 

m086 . pep VLRSMESLGWQSIWRGTANLIMSATNPQXRRETLEMYGRXRAIILPIMLVAFX3LVLIMVQ 
I I I I I I I I i I i I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I 
a086 VLRSMESLGWQSIWRGTANLIMSATNPQARRETLEMYGRFRAIILPIMLVAFGLVLIMVQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

m086 . pep PDFGSEWITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 
I I I I I I I i I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I i I I I I I I I I I I I I I i I I I I I I I 
a086 PDFGSFWITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m086 . pep DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 
I I I I I I I i i I I I I I I I I I I I I I ) I I I I I M I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I 
a086 DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 
250 260 270 280 290 300 

310 320 330 340 350 360 

m086.pep IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGXQSFFNIGVNIGALPXKGLTXP 
I j I I I I I I I t I I I I I I I I I I I I I t I I I I I I I I I t I I I I I I 1 j I j I t I I I I I I : I I I I i 
a086 IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGIQSFFNIGVNIGALPTKGLTLP 

310 320 330 340 350 360 

370 380 390 

mO 8 6 . pep XMSXGGSS VFFTILISMMLLXRI DYENRRKMRGYRVEX 

II I I I I I t I i I t I I I I I I I I I I I I I I I I I I I I I I 
a086 LMSYGGSSVFFMLISMMLLLRIDYENRRKMRGYRVEX 

370 380 390 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 277>: 

g0B7.seq 

1 ATGGGCGGTA AAACCTTTAT GCTGATGGCG GGCGGAACGG GCGGACACAT 

51 TTTCCCAGCT CTGGCTGTGG CGGATTCATT GCGCGTGCGC GGTCATCATG 

101 TAATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGCAT CGTGCCGCAA 

151 TACGGCATAC GCTTGGAAAC GCTGGCGATT AAAGGAATAC GCGGCAACGG 

201 CATCAAACGC AAGCTGATGC TTCCGTTTAC TCTGTACAAA ACCGTCCGCG 

251 AAGCGCAGCG GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

301 GGCGGTTTTG TTACCTTTCC CGGCGGTCTG GCGGCGAAAC TCTTGGGCGT 

351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGCTTG TCCAACCGCC 

401 AccTGTCGCg ctGGGCGAAA CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

451 AGCCACGAAG GCGGTTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGCGAAGGC CGTCTGAAAA 

551 TTTTGGTGGT CGGCGGCAGT TTGGGTGCGG ACGTTTTGAA CAAAACCGTA 

601 CCGCAGGCGT TGGCACTGCT GCCTGAAGAG GTGCGCCCGC AGATGTACCA 

651 CCAGTCGGGG CGTAACAAGC TGGGCAATCT TCAGGCGGAT TATGACGCGT 

701 TGGGCGTGAA AGCGGAATGC GTGGAATTTA TTACCGACAT GGTGTCCGCC 

751 TACCGTGATG CCGATTTGGT GATTTGCCGT GCCGGCGCGC TGACGATTGC 

801 CGAGTTGACG GCGGCGGGGC TGGGCGCGTT GTTAGTGCCG TATCCTCACG 

851 CCGTTGATGA CCATCAAACC GCCAACGCGC GTTTCATGGT GCAGGCAGAA 

901 GCGGGGCTGC TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA 

951 AATCCTCGGC AGCCTCAACC GCGAAAAATG CCTCAAATGG GCGGAAAACG 

1001 CCCGTACGTT GGCATTGCCG CACAGCGCGG ATGACGTTGC CGAAGCCGCG 

1051 ATTGCGTGTG CGGCGTAAA 

This corresponds to the amino acid sequence <SEQ ID 278; ORF 087.ng>: 

9087. pep 

1 MGGKTFMLMA GGTGGHIFPA LAVADSLRVR GHHVIWLGSK DSMEERIVPQ 

51 YGIRLETLAI KGIRGNGIKR KLMLPFTLYK TVREAQRIIR KHRVE CVIGF 

101 GGFVTFPGGL AA KLLGVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 
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201 PQALALLPEE VRPQMYHQSG RNKLGNLQAD YDALGVKAEC VEFITDMVSA 
251 YRDADLVICR A GALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 
301 AGLLLPQTQL TAEKLAEILG SLNREKCLKW AENARTLALP HSADDVAEAA 
351 lACAA* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 279>: 

m087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGAwkkCG GGCGGAACGG GCGGACATAT 

51 TTTCCCCGCG CTGGCGGTGG CGGATTCATT GCGCGCGCGC GGCCATCATG 

101 TGATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGTAT CGTGCCGCAA 

151 TACGGCATAC GCTTGGAAAC GCTGGCGATT AAAGGCGTGC GCGGCAACGG 

201 CATCAAACGC AAACTGATGC TGCCGGTTAC TTTGTATCAA ACCGTCCGCG 

251 AAGCGCAGCG GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

301 GGCGGCTTCG TTACCTTCCC CGGCX5GTTTG GCGGCGAAGC TATTArGCGT 

351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGTTTG TCCAACCGCC 

4 01 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

4 51 AGCCACGAAG GCGGCTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGTGAAGGC CGTCTGAAAA 

551 TTTTGGTGGT CGGCGGCAGT TTGGGCGCGG ACGTTTTGAA CAAAACCGTA 

601 CCGCATGCAT TGGCTTTGCT GCCCGACAAT GCGCGTCCGC ATATGTACCA 

651 CCAATCGGGA CGGGGCAAGC TGGGCATCTT GCAGGCGnnn nnnnnnnnnn 

701 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

751 nnnGCGGGAT TGGGTGCGTT GTTAGTGCCG TATCCTCACG CGGTTGACGA 

801 TCACCAAACC GCCAACGCGC GTTTTATGGT GCAGGCGGAG GCGGGATTGC 

851 TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA GATTCTCGGC 

901 GGCTTAAACC GCGAAAAATG CCTCAAATGG GCAGAAAACG CCCGTACGTT. 

951 GGCACTGCCG CACAGTGCGG ACGACGTGGC GGAAGCCGCG ATTGCGTGTG 

1001 CGGCGTAA 

This corresponds to the amino acid sequence <SEQ ED 280; ORF 087>: 

m087 .pep 

1 MGGKTFMLXX GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YGIRLETLAI KGVRGNGIKR KLMLPVTLYQ TVREAQRIIR KHRVE CVIGF 

101 GGFVTFPGGL AA KLLXVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 

201 PHALALLPDN ARPHMYHQSG RGKLGILQAX XXXXXXXXXX XXXXXXXXXX 

251 XAGLGALLV P YPHAVDDHQT ANARFMVQAE AGLLLPQTQL TAEKLAEILG 

301 GLNREKCLKW AENARTLALP HSADDVAEAA lACAA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 087 shows 83.9% identity over a 355 aa overlap with a predicted ORF (ORF 087.ng) 
from N. gonorrhoeae: 

m087/g087 

10 20 30 40 50 60 

MGGKTFMLXXGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYGIRLETLAI 

Illlllll iiiiiiiiiiiiiMiihiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

MGGKTFMLMAGGTGGHIFPALAVADSLRVRGHHVIWLGSKDSMEERIVPQYGIRLETLAI 
10 20 30 40 50 60 

70 80 90 100 110 120 

KGVRGNGI KRKLMLPVTLYQTVREAQRII RKHRVECVIGFGGFVTFPGGLAAKLLXVP I V 

Ihllllllllllll illHIIIIIIIillllllMIIIIIIIIIIIIMIIII INI 

KGIRGNGIKRKLMLPFTLYKTVREAQRIIRKHRVECVIGFGGFVTFPGGLAAKLLGVPIV 
70 80 90 100 110 120 

130 140 150 160 170 180 

IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 

lllllllllllllllllllllllllllllllllilMIMIIIIIIIIMIIIIIIIIII 

IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 
130 140 150 160 170 180 



m087 .pep 
g087 

m087 .pep 
g087 

m087 .pep 
g087 
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190 200 210 220 229 
RLKILWGGSLGADVI^KTVPHALALLPDNARPHMYHQSGRGKLGIIiQA- 

IIIIIIIIIIIIIIIMIIIhlllllh::|hllllllhlll III 
RLKILWGGSUSADVLNKTVPQALALLPEEVRPQMYHQSGRNKLGNLQADYDAIiGVKAEC 
190 200 210 220 230 240 

230 240 250 
AGLGALLVPYPHAVDDHQTANARFMVQAE 

IIIIIIIIIIIIIIIIIIIIIIIMIIII 
VEFITDMVSAYRDADLVICRAGALTIAELTAAGLGALLVPYPHAVDDHQTANARFMVQAE 
250 260 270 280 290 300 

260 270 280 290 300 310 

AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 

I II M II II II I II II I M h 1 1 1 1 II II I II I II II I II III M II 1 1 1 II II II 

AGLLLPQTQLTAEKIiAEILGSLNREKCLKWAENARTLALPHSADDVAKAAIACAAX 
310 320 330 340 350 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 28 1>: 

a087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGATGGCG 

51 TTTCCCCGCG CTGGCGGTGG CGGATTCATT 

101 TAATTTGGCT GGGCAGCAAG GATTCGATGG 

151 TACGACATCC TGCTCGAAAC GCTGGCGATT 

201 CATCAAACGC AAGCTGATGC TGCCGTTTAC 

251 AAGCGCAGCA GATTATCCGC AAACACCGTG 

301 GGCGGCTTCG TTACCTTTCC CGGCGGTTTG 

351 GCCGATTGTG ATTCACGAGC AAAACGCCGT 

401 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT 

4 51 AGCCACGAAG GCGGCTTGGT CGGCAACCCC 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG 

551 TTTTGGTGGT CGGCGGCAGT TTGGGCGCGG 

601 CCGCAGGCAT TGGCTTTGCT GCCCGACAAT 

651 CCAATCGGGA CGGGGCAAGC TGGGCAGCTT 

701 TGGGCGTGCA AGCGGAATGC GTGGAATTTA 

751 TACCGCGATG CCGATTTGGT GATTTGCCGT 

801 CGAGTTGACG GCGGCGGGAT TGGGTGCGTT 

851 CCGTTGATGA CCATCAAACC GCCAACGCGC 

901 GCGGGATTGC TGTTGCCGCA AACCCAGTTG 

951 GATTCTCGGC GGCTTAAACC GCGAAAAATG 

1001 CCCGTACGTT GGCACTGCCG CACAGTGCGG 

1051 ATTGCGTGTG CGGCGTAA 

This corresponds to the amino acid sequence <SEQ ID 282; ORF 087,a>: 

a087 .pep 

1 MGGKTFMLMA GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YDILLETLAI KGVRGNGIKR KLMLPFTLYQ TVREAQQIIR KHRVE CVIGF 

101 GGFVTFPGGL AA KLLGVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 

201 PQALALLPDN ARPQMYHQSG RGKLGSLQAD YDALGVQAEC VEFITDMVSA 

251 YRDADLVICR AG ALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 

301 AGLLLPQTQL TAEKLAEILG GLNREKCLKW AENARTLALP HSADDVAEAA 

351 lACAA* 

in087/a087 85.4% identity over a 355 aa overlap 

10 20 30 40 50 60 

m087 . pep MGGKTFMLXXGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYGIRLETLAI 
i I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
a087 MGGKTFMLMAGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYDILLETLAI 

10 20 30 40 50 60 



m087 .pep 
g087 

m087 .pep 
g087 

m087 .pep 
g087 



GGCGGAACGG 
GCGCGCGCGC 
AAGAGCGCAT 
AAAGGCGTGC 
TTTGTATCAA 
TCGAGTGCGT 
GCGGCGAAGT 
GGCAGGTTTG 
ACGCTTTTCC 
GTCCGCGCCG 
GCGTGAAGGC 
ACGTTTTGAA 
GCGCGTCCGC 
GCAGGCGGAT 
TTACCGATAT 
GCCGGCGCGC 
GTTAGTGCCG 
GTTTTATGGT 
ACGGCGG7UUV 
CCTCAAATGG 
ACGACGTTGC 



GCGGACATAT 
GGCCATCATG 
CGTGCCGCAA 
GCGGCAACGG 
ACTGTCCGCG 
CATCGGCTTC 
TATTAGGCGT 
TCCAACCGCC 
GAAAGCGTTC 
ATATTAGCTU^ 
CGTCTGAAAA 
CAAAACCGTA 
AGATGTACCA 
TACGACGCGC 
GGTGTCCGCC 
TGACGATTGC 
TATCCTCACG 
GCAGGCGGAG 
AACTCGCCGA 
GCAGAAAACG 
CGAAGCCGCG 
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70 80 90 100 110 120 

KGVRGNGIKRKLMLPVTLYQTVREAQRIIRKHRVECVIGFGGFVTFPGGLAAKLLXVPIV 
I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I M I i I I I I I I I I I I I I I I I M I I I I 

kgvrgngikrklmlpftlyqtvreaqqiirkhrvecvigfx;gfvtfpggiaakllgvpiv 

70 80 90 100 110 120 

130 140 150 160 170 180 

iheqnavaglsnrhlsrwakrvlyafpkafshegglvgnpvradisnlpvpaerfqgreg 
1 1 1 1 1 1 1 1 1 1 1 1 j I i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 i 1 1 1 1 1 1 1 1 
iheqnavaglsnrhlsrwakrvlyafpkafshegglvgnpvradisnlpvpaerfqgreg 

130 140 150 160 170 180 

190 200 210 220 230 240 

rlkilvvggslgadvlnktvphalallpdnarphmyhqsgrgklgilqaxxxxxxxxxxx 

I I I I I I I I I I I I I I I I I I I ) I : i I I I I I I I I I I : I t I I I I i I I I I III 

rlkilvvggslgadvlnktvpqalallpdnarpqmyhqsgrgklgslqadydalgvqaec 

190 200 210 220 230 240 

250 260 270 280 

XX xxxxxxxxxaglgallvpyphavddhqtanarfmvqae 

: : I I I ) I I I II I I I I I I I II i II I I t I I I I I 

vefitdmvsayrdadlvicragaltiaeltaaglgallvpyphavddhqtanarfmvqae 

250 260 270 280 290 300 

290 300 310 320 330 

aglllpqtqltaeklaeilgglnrekclkwaenartlalphsaddvaeaaiacaax. 
I I I i II I I I M I I I M I I II I I I I I I II I I I I I I I I I I I II 11 I I I I I I I I M I I I 
aglllpqtqltaeklaeilgglnrekclkwaenartlalphsaddvaeaaiacaax 

310 320 330 340 350 

The following partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 283>: 

g088 . seq 

1 atgtttttat ggctcgcaca tttcagcaac tggttaaccg gtctgaatat 

51 ttttcaatac accacattcc gcgccgttat ggcggcgttg accgccttgg 

101 cgttttccct gatgttcggc ccgtggacga tacgcaggct gaccgcgctc 

151 aaatgcgggc aggcagtgcg taccgacggc ccgcaaaccc acctcgtcaa 

201 aaacggcacg ccgacgatgg gcggttcgct gattctgacc gccattaccg 

251 tgtccaccct gttgtggggc aactgggcga acccgtatat ctggattctc 

301 ttgggcgtac tgcttgccac cggtgcgctc ggtttttacg acgactggcg 

351 caaagtcgtt tataaagacc ccaacggcgt gtccgccaaa ttcaaaatgg 

401 TGTGGCAGTC AAGCGTTGCC GTTatcgcCG GTttggcaTT GTTTTACctt 

451 gCcgcCAATT CCGCCAACAA TATTTTGATT GTCCCGtttT TCAAACAAAT 

501 CGCCCTGCCG CTGGGCGTGG TCGGCTTttt gGtgttgTCT TACCTGACCA 

551 TCGTCGGCAC ATCCAACGCC GTCAACCTCA CcgaCGGCTT GGACGGCCTT 

601 GCCGCcttcc cgttcgtcct cgttgccgcC GGGCTCGCCA ttttcgccTA 

651 CGTCAGCGGA CACTACCAAT TTTCCCAATA CCTCCAGCTT CCCTATGTCG 

701 CCGGCGCGAA CGAAGTCGCT ATATTCTGCA CCGCCATGTG CGGCGCGTGC 

751 CTCGGATTTT TGTGGTTCAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 

801 TGTCGGCGCG CTGGCATTGG GTGCCGCGCT CGGTaccGtt gCCGTcaTcg 

851 tCCGCCAAGA ATTTGTcctc gtcattaTGG GCGGTCTGrT cgtcgtagaa 

901 gccgtgTCCG TTATGCTTCa tgtcggCTGG TACAAGAAAA Ccaaaaaacg 

951 CATCTTcCTg acgGcaccga ttcatcacca ttaCCaactt cgatgCTGGa 

1001 aagaaacgca agtcgtcgtc CGTTtCTGGA TTAtTAccat cgtcgtggtt 

1051 tTgataggtt tGagtacccT caAAattcgc ggaaactatg ccgTCCGAAC 

1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 284; ORF 088.ng>: 

g088 .pep 

1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAF5LMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AANSANNILI VPFFKQIALP I1GWGFI.VLS YLTIVGTSNA VNLTDGLDGL 



m087 .pep 
a087 

m087 .pep 
a087 

m087 .pep 
a087 

m087 .pep 
a087 

m087 .pep 
a087 
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201 AAFPFVLVAA GLAIFAY VSG HYQFSQYLQL PYVAGAN EVA IFCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

3 01 AVSVMLHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQ VW RFWIITIWV 

351 LIGL STLKIR GNYAVRTPFR RHLNAQ* 

The following partial DNA sequence was identified in ^meningitidis <SEQ ID 285>: 
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451 
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501 


nnnnnnnnnn 


nnnGGCGTGG 


TCGGCTTTTT 


GGTGTTGTCT 


TACCTGACCA 


551 


TCGTCGGCAC 


ATCCAATGCC 


GTCAACCTCA 


CCGACGGCTT 


GGACGGCCTT 


601 


GCGACCTTCC 


CCGTCGTCCT 


CGTTGCCGCC 


GGCCTCGCCA 


TCTTCGCCTA 


651 


TGCCAGCGGC 


CACTCACAAT 


TTGCCCAATA 


CCTGCAATTA 


CCTTACGTTG 


701 


CCGGCGCAAA 


CGAAGTGGTG 


ATTTTCTGTA 


CCGCCATGTG 


CGGCGCGTGC 


751 


CTCGGTTTCT 


TGTGGTTTAA 


CGCCTATCCC 


GCGCAAGTCT 


TTATGGGCGA 


801 


TGTCGGTGCA 


TTGGCATTGG 


GTGCCGCGCT 


CGGTACCGTC 


GCCGTTATCG 


851 


TCCGCCAAGA 


GTTTGTCCTC 


GTCATTATGG 


GCGGATTATT 


TGTCGTAGAA 


901 


GCCGTATCCG 


TTATGCTTCA 


GGTTGGCTGG 


TATAAGAAAA 


CCAAAAAACG 


951 


CATCTTCCTG 


ATGGCGCCCA 


TCCATCACCA 


CTACGAACAA 


AAAGGCTGGA 


1001 


AAGAAACCCA 


AGTCGTCGTC 


CGCTTTTGGA 


TTATTACCAT 


CGTCTTGGTG 


1051 


TTGATCGGTT 


TGAGTACCCT 


CAAAATCCGC 


TGAACCTATG 


CCGTCTGAAC 


1101 


ATCTTTCAGA 


CGGCATTTGA 


ACGCGCAATA 


A 





1 MFLWLAHFSN WLTGLNIFQY T TFRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCX3QAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLLW G NWANPYIWIL 

101 LGVLLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AANSAN NILI VPFFKQI ALP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 AAFPFVLVAA GLAIFAY VSG HYQFSQYLQL PYVAGANE VA IFCTAMCGAC 

251 LGFLW FNAYP AQVFMG DVGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

301 AVSVM LHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQ VW RFWIITIWV 

351 LIGLSTLKIR GNYAVRTPFR RHLNAQ* 



This corresponds to the amino acid sequence <SEQ ID 286; ORF 088>: 

m088 .pep 

1 MFLWLAHFSX WLTGLNXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

101 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 ATFPWLVAA GLAIFA YASG HSQFAQYLQL PWAGANE W IFCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

301 AVSVM LQVGW YKKTKKRIFL MAPIHHHYEQ KGWKETQ VW RFWIITIVLV 

351 LIGL STLKIR XTYAVXTSFR RHLNAQ* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K^onorrhoeae 

ORF 088 shows 91.7% identity over a 205 aa overlap with a predicted ORF (ORF 088.ng) 
from N, gonorrhoeae: 

m088/g088 

10 20 30 

GWGFLVLSYLTIVGTSNAVNLTDGLDGLA 

IlilllllllMIIIIIIIIIIIIIIIIII 

lAGLALFYLAANSANNILIVPFFKQIALPLGWGFLVLSYLTIVGTSNAVNLTDGLDGLA 
150 160 170 180 190 200 
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40 . 50 60 70 80 90 

TFP WLVAAGLAI FAYASGHSQFAOY LQLP YVAGANE WI FCTAMCGACLGFLWFNAYPA 

:|| lllllllllllhlll Ihlllllllllllllhlllllllllllllllllllll 
AFPFVLVAAGLAIFAYVSGHYQFSQYLQLPYVAGANEVAIFCTAMCGACLGFLWFNAYPA 
210 220 230 240 250 260 

100 110 120 130 140 150 

QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFWEAVSVMLQVGWYKKTKKRIFLM 

llllllllllllllllllllllllllllllllllllllllllllhlllllllllllll 
QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFWEAVSVMLHVGWYKKTKKRIFLT 
270 280 290 300 310 320 

160 170 180 190 200 

APIHHHYEQKGWKETQVWRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 

llllllh : lllllllllllllllhlllllllllll :||l I lllllllll 
APIHHHYQLRCWKETQVWRFWIITIVWLIGLSTLKIRGNYAVRTPFRRHLNAQX 
330 340 350 360 370 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 287>: 

a088.8eq 

1 ATGTTTTTAT GGCTCGCACA TTTCAGCAAC TGGTTAACCG GTCTGAATAT 

51 TTTTCAATAC ACCACATTCC GCGCCGTCAT GGCGGCGTTG ACCGCCTTGG 

101 CGTTTTCCCT GATGTTCGGC CCGTGGACGA TACGCAGGCT GACCGCGCTC 

151 AAATGCGGGC AGGCAGTGCG TACCGACGGT CCGCAAACCC ACCTCGTCAA 

201 AAACGGCACG CCGACGATGG GCGGTTCGCT GATTCTGACC GCCATTACCG 

251 TGTCCACCCT GTTGTGGGGC AACTGGGCAA ACCCGTATAT CTGGATTCTC 

301 TTGGGCGTAT TGCTCGCCAC GGGCGCACTC GGTTTTTACG ACGACTGGCG 

351 CAAAGTCGTC TATAAAGACC CCAACGGCGT GTCCGCCAAA TTCAAAATGG 

401 TGTGGCAGTC AAGCGTTGCC ATTATCGCCG GTTTGGCATT GTTTTACCTT 

451 GCCGCCAATT CCGCCAACAA TATTTTGATT GTCCCGTTCT TCAAACAAAT 

501 CGCCCTGCCG CTGGGCGTGG TCGGCTTTTT GGTGTTGTCT TACCTGACCA 

551 TCGTCGGCAC ATCCAATGCC GTCAACCTCA CCGACGGCTT GGACGGCCTT 

601 GCGACCTTCC CCGTCGTCCT CGTTGCCGCC GGCCTCGCCA TCTTCGCCTA 

651 TGCCAGCGGC CACTCACAAT TTGCCCAATA CCTGCAATTA CCTTACGTTG 

701 CCGGCGCAAA CGAAGTGGTG ATTTTCTGTA CCGCCATGTG CGGCGCGTGC 

751 CTCGGTTTCT TGTGGTTTAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 

801 TGTCGGTGCA TTGGCATTGG GTGCCGCGCT CGGTACCGTC GCCGTCATCG 

851 TCCGCCAAGA GTTTGTCCTC GTCATTATGG GCGGATTATT TGTCGTAGAA 

901 GCCGTATCCG TTATGCTTCA GGTCGGCTGG TATAAGAAAA CCAAAAAACG 

951 CATCTTCCTG ATGGCGCCCA TCCATCACCA CTACGAACAA AAAGGCTGGA 

1001 AAGAAACCCA AGTCGTCGTC CGCTTTTGGA TTATTACCAT CGTCTTGGTG 

1051 TTGATCGGTT TGAGTACCCT CAAAATCCGC TGAACCTATG CCGTCTGAAC 

1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 288; ORF 088.a>: 

aOSB.pep 

1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA IIAGLALFYL 

151 AANSANNILI VPFFKQI ALP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 ATFPWLVAA GLAIFA YASG HSQFAQYLQL PYVAGANE VV IFCTAMCGAC 

251 LGFLWFNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

301 AVSV MLQVGW YKKTKKRIFL MAPIHHHYEQ KGWKETQ VW RFWIITIVLV 

351 LIGL STLKIR *TYAV*TPFR RHLNAQ* 



m088/a088 99.5% identity over a 205 aa overlap 

150 160 170 180 190 200 

m088 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGWGFLVLSYLTIVGTSNAVNLTDGLDgLa 

M I I I I I I I I I I I I I I I I ) I I i I I I I I I I I 
a088 IAGLALFYLAANSANNILIVPFFKQIALPLGWGFLVLSYLTIVGTSNAVNLTDGLDGLA 
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150 160 170 180 190 200 

210 220 230 240 250 260 

m088 . pep TFPWLVAAGLAIFAYASGHSQFAQYLQLPYVAGANEWIFCTAMCGACLGFLWFNAYPA 
I I I I I I I I I t I I I I I I I I I I I I I I i I I I I I I I I I I I I I I M I I I i I I I I I I I I I t I M I I 
a088 TFPWLVAAGLAIFAYASGHSQFAQYLQLPYVAGANEVVIFCTAMCGACLGFLWFNAYPA 

210 220 .230 240 250 260 

270 280 290 300 310 320 

m088 . pep QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFWEAVSVMLQVGWYKKTKKRIFLM 
llllllllilltllllltlllllllllllllllllllllllllllltlllllttillili 
a088 QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFWEAVSVMLQVGWYICKTKKRIFLM 

270 280 290 300 310 320 

330 340 ■ 350 360 370 

m088 . pep APIHHHYEQKGWKETQWVRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 
I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I ) I 
a088 APIHHHYEQKGWKETQWVRFWIITIVLVLIGLSTLKIRXTYAVXTPFRRHLNAQX 
330 340 350 360 370 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 289>: 

g089 . seq 

1 ATGCCGCCCA AAATCACX3AA GAGCGGGTTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATG AATACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACACGGT GCGGCAAGCC TTGGAAGGTT 

151 TTGACGTGTT CCAGCAATGC TTCGCGCGGC AAACCGACGG CCTCGCACAA 

201 AGCCACGGCA GCCATAACGT TGGCGGCGTT GTGCAAACCT TGCAGCGGGA 

251 TGTCTTGCGT AGAAATCAAA TCTTCATTGC CTTGTTTTAA ACAGCCCGTC 

301 CCGCGTTCCA ACCAAAAATC GGCTTCGTGT TCCAAGGAAA ACCGTTTCAC 

351 TTCACGCCCT GCCCGTTTCA TGGCGCGGCA GAACACGTCG TCCGCATTCa" 

401 AAACCTGCAC TCCATCGCCA CGGAAAATCT CGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 290; ORF 089.ng>: 

g089.pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGKPWKV 

51 LTCSSNASRG KPTASHKATA AITLAALCKP CSGMSCVEIK SSLPCFKQPV 

101 PRSNQKSASC SKENRFTSRP ARFMARQNTS SAFKTCTPSP RKISALVCA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 29 1>: 

m089 . seq 

1 ATGCCGCCCA AAATCACkAw GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCAGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATA AACACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGGAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAACCGATGG CCTCACACAA 

201 AGCCACkGCA GCCATGACGT TAGCGGCGTT GTGCAkACCT TGCAACGGwA 

251 TGTCTTGCGT GACAATCAAA TCTTCATTGC CTTGTTTCAG GCGGCCTGTC 

301 TCGCGTTCCA ACCAGAAATC AGCTTCGTGT TCCAAGGAAA ACCATTTTAC 

351 CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACGCATCG TCCGCATTCA 

401 AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCATAG 

This corresponds to the amino acid sequence <SEQ ID 292; ORF 089>: 

m089 .pep 

1 MPPKITXSGF CKP AIAAAVA PTFVPLLSSI NTTPFFSPIF STRCGRPWKV 

51 LTCSSNASRD KPMASHKATA AMTLAALCXP CNC^SCVTIK SSLPCFRRPV 

101 SRSNQKSASC SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF fi-om A^. gonorrhoeae 

ORF 089 shows 88.6% identity over a 149 aa overlap with a predicted ORF (ORF 089.ng) 
from N. gonorrhoeae: 

ni089/g089 
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10 20 30 40 50 60 

m089 . pep MPPKITXSGFCKPAIAAAVAPTFVPLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

IIIMI lllllllllllllllllllllhllllllllllllllhlllllMlllill 

g089 MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPIFSTRCGKPWKVLTCSSNASRG 
10 20 30 40 50 60 

70 80 90 100 110 120 

m089 . pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 

II IIIMIIhllllll Ihlllll lllllllhMI lllllllllhlhlllll 

g089 KPTASHKATAAITLAALCKPCSGMSCVEIKSSLPCFKQPVPRSNQKSASCSKENRFTSRP 
70 80 90 100 110 120 



130 140 150 

m089 .pep ARFIARQNASSAFKTCTPSPRKILALVCAX 

llhlllhllllllMIMMI mill 

g 0 8 9 ARFMARQNTSS AFKTCTPS PRKI SALVCAX 

130 140 150 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 293>: 

a089. seq 

1 ATGCCGCCTA AAATCACGAA GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCA CCGACGTTCG TGCCTTTGCT GTCGTCGATG AACACCACGC 

101 CATTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGAAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGGC AAACCGACGG CTTCGCACAA 

201 GGCAACGGCA GCCATCACGT TAGTGGCGTT GTGCAAGCCT TGCAGCGGAA 

251 TATCTTGCGT GGCAATCAAA TCTTCATTGC CTTGTTTCAG GCGACCTGTC 

301 TCACGTTCCA ACCAAAAATC GGCTTCGTAT TCCAACGAAA ACCATTTCAC 

351 CTCGCGCCCG GCGCGCTTCA TCGCACGACA GAACGCATCG TCCGCATTCA 

401 AAACCTGCAC ACCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 294; ORF 089.a>: 

a089.pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGRP*KV 

51 LTCSSNASRG KPTASHKATA AITLVALCKP CSGISCVAIK SSLPCFRRPV 

101 SRSNQKSASY SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 



in089/a089 91 .9% identity over a 149 aa overlap 

10 20 30 40 50 60 

m089.pep MPPKITXSGFCKPAIAAAVAPTFVPLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

HUM iiiiiiiiiiiiiiitiiiiiuiiiiiiiiiiiiiiiii iiiiiiiiiii 

a089 MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPIFSTRCGRPXKVLTCSSNASRG 
10 20 30 40 50 60 



70 80 90 100 110 120 

m089 . pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 
II I I II I I II : I I : I I I il : I : I i I : I I I I I I I I i I II t II I I I I II I I I II I I I I I 
a089 KPTASHKATAAITLVALCKPCSGISCVAIKSSLPCFRRPVSRSNQKSASYSNENHFTSRP 

70 80 90 100 110 120 



130 140 150 

mOe 9 . pep ARFIARQNASSAFKTCTPSPRKILALVCAX 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
a089 ARFIARQNASSAFKTCTPSPRKILALVCAX 

130 140 150 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 295>: 

g090 . seq 

1 ATGCGCGTAG TCGAGCAAAT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 
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51 TGTTCATCAC CGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCcca CACCCACACG TCCGCCTTTT TGCCTTCgcg 

151 ctgCAATtct gcctccaaga cgggcgtacc gatATTGCCC GCAATGAcgg 

201 tatccagccc gcacttgatg CAGAGatagc ggaccaggct ggttaccgTG 

251 GTTttgccgt tgctgCcggt aatcgCaatc accttgtcgC CGCGGCGGtt 

301 cAcaaTGTCc gccaGCAATt ggATGTCGCC TAgCACGCGC .CcgccgTTT 

351 TGCttga 

This corresponds to the amino acid sequence <SEQ ID 296; ORF 090.ng>: 

g090 .pep 

1 MRWEQIWA VEMVFGNVHH RRRSRAQAFG VFQLEAGKLP HPHVRLFAFA 

51 LQFCLQDGRT DIARNDGIQP ALDAEIADQA GYRGFAVAAG NRNHLVAAAV 

101 HNVRQQLDVA XHAXRRFA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 297>: 

m090.seq 

1 ATGCGCATAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC CGTCGCCGCA GTCGGACGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC GCGATAACGG 

201 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT CGTTACCGTG 

251 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT CGCGGCGGTT 

301 CACAATGTCC GCCAGCAATT GGATGTCGCC CAACACGCGT .CCGCCGTTT 

351 TGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 298; ORF 090>: 

m090.pep 

1 MRIVEQVWA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 

51 LPFRLQNRRA DIARDNGIQP TUjDTEIADQA RYRGFAVAAG NRNYLWPAV 

101 HNVRQQFDVA QHAXRRFA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF jfrom N. gonorrhoeae 

ORF 090 shows 83.9% identity over a 118 aa overlap vnih a predicted ORF (ORF 090.ng) 
from K gonorrhoeae: 

m090/g090 

10 20 30 40 50 60 

mO 9 0 . pep MRI VEQVWAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 

Ihllhllllllllllhlllllhlllllllllllll lllllllllll I Ih h 
gO 9 0 MRWEQI WAVEMVFGNVHHRRRSRAQAFGVFQLEAGKLPHPHVRLFAFALQFCLQDGRT 

10 20 30 40 50 60 

70 80 90 100 110 119 

m090 .pep DIARDNGIQPALDTEIADQARYRGFAVAAGNRNYLWPAVHNVRQQFDVAQHAXRRFAX 

lllhHIIIIIhllllll IIIIMIIIIIhll: lllllllhlll llllllll 
g090 DIARNDGIQPALDAEIADQAGYRGFAVAAGNRNHLVAAAVHNVRQQLDVAXHAXRRFAX 
70 80 90 100 110 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 299>: 

a090.seq 

1 ATGCGCGTAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC TGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAACTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCAATTCC GCCTCCAAAA CCGGCGCGCC GATATTGCCC GCGATAACGG 

201 TATCCAGCCC ACACTTGATG CAGAGATAGC CGACCAGGCT CGTTACCGTG 

251 GTTTTGCCGT TGCTGCCGGT TVATCGCAATC ACCTTGTCGC CGCGGCGGTT 

301 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT CCGCCGTTT 

351 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 300; ORF 090.a>: 
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MRWEQVVVA VEMVFGNVQH CRRSRAQAFG VFQLETGKLQ HPHVRLFAFA 
LQFRLQNRRA DIARDNGIQP TLDAEIADQA RYRGFAVAAG NRNHLVAAAV 
HNVRQQFDVA QHAXRRFA* 

91.5% identity over a 1 17 aa overlap 

10 20 30 40 50 60 

MRIVEQVVVAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 
I I : I I I I I M I I I I I I I I I I I I I I : I I I I I I I I i : i I I I I I I I I I I I I I I I I I I I I I i 
MRVVEQWVAVEMVFGNVQHCRRSRAQAFGVFQLETGKLQHPHVRLFAFALQFRLQNRRA 

10 20 30 40 50 60 

70 80 90 100 110 119 

DIARDNGIQPALDTEIADQARYRGFAVAAGNRNYLVVPAVHNVRQQFDVAQHAXRRFAX 
I i I I I I I I I I : I I : I I I I t 11 I I t I I I I I I I I I : I I : I i I I I t I I I I t I i I I I I I N I 
DIARDNGIQPTLDAEIADQARYRGFAVAAGNRNHLVAAAVHNVRQQFDVAQHAXRRFAX 

70 80 90 100 110 



The following partial DNA sequence was identified in A^. gonorrhoeae 

g090-l.sdq This sequence contains multiple stop codons (not shown) 

This corresponds to the amino acid sequence < ORF 090-l.ng>: 

g090-l.pep (not shown) 

The following partial DNA sequence was identified in K meningitidis <SEQ BD 2>: 

m090-l.seq 

1 ATGACGGCGT TTGCATTTCA GACGGCATCA CAAAGCCTTA AACGCTTCGA 

51 TAAACACTTC CGAACGGTGC GCGTAGCCTT TGAACATATC AAAGCTCGCG 

101 CAGGCGGGGC TGAGCAACAC AATATCGCCT GCTTCGGCTT GGGCATATGC 

151 CGTCTGAACG GCTTCTCCCA AAGTGGCGCA GTCGGTCATA TTCAAGCCGC 

201 AGCCGTCCAA ATCGCGGCGG ATTTGCGGCG CATCGACACC AATCAAGAAC 

251 ACGCCTTTTG CCTTGCCTAC CAGTGCATCG CGCAGGGGCG TGAAGTCCTG 

301 CCCTTTACCC ATGCCGCCCA AAATCACGAA GAGCGGATTT TGCAAACCGG 

351 CAATCGCGGC GGCAGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATA 

401 AACACCACGC CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC 

4 51 TTGGAAGGTT TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAACCGATGG 

501 CCTCACACAA AGCCACGGCA GCCATGACGT TAGCGGCGTT GTGCAGACCT 

551 TGCAACGGAA TGTCTTGCGT GACAATCAAA TCTTCATTGC CTTGTTTCAG 

601 GCGGCCTGTC TCGCGTTCCA ACCAGAAATC AGCTTCGTGT TCCAACGAAA 

651 ACCATTTTAC CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACGCATCG 

701 TCCGCATTCA AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT 

7 51 ATGCGCATAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

801 TGTTCAGCAC CGTCGCCGCA GTCGGACGCA GGCTTTCGGT GTTTTCCAGT 

851 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

901 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC GCGATAACGG 

951 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT CGTTACCGTG 

1001 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT CCCGGCGGTT 

1051 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT CCGCCGTTTT 

1101 GCTTGAACGC CTCAATATCC GGCTGCCGCT CGCTGATGCC GGGACTGAGA 

1151 GCCAGAATAT CGAAACCGTT GTCCAGCGCA TCTTTCAGAC GGCCCGTGTA 

1201 AAACACCAAC CCGTCAAACA TCTTACCGAT TTGCGACACG CGTTCCGGCT 

1251 TCAGCTCCGC ATCATACGCA GCAACCTCCG CGCCGTTTTT GCGCAGGTAG 

1301 GCAATCATGG AAATACCCGT ACCGCCGAGT CCGGCGACGA GGATTTTTTT 

1351 GTTTTGAAAA GTCATTTTGG TTTGTCCTAA 

This corresponds to the amino acid sequence <SEQ ID 3; ORF 090- 1>: 

mOQO-l.pep 

1 MTAFAFQTAS QSLKRFDKHF RTVRVAFEHI KARAGGAEQH NIACFGLGIC 

51 RLNGFSQSGA VGHIQAAAVQ lAADLRRIDT NQEHAFCLAY QCIAQGREVL 

101 PFTHAAQNHE ERILQTGNRG GSRADIRAFA VVDKHHAVFL ADFFHAVRQA 

151 LEGFDVFEQC FARQTDGLTQ SHGSHDVSGV VQTLQRNVLR DN QIFIALFQ 

201 AACLAFQPEI SFVFQRKPFY LAPGTLHRAA ERIVRIQNLH AVATENLGFG 

251 MRIVEQVWA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 

301 LPFRLQNRRA DIARDNGIQP ALDTEIADQA RYRGFAVAAG NRNYLWPAV 

351 HNVRQQFDVA QHASAVLLER LNIRLPLADA GTESQNIETV VQRIFQTARV 

401 KHQPVKHLTD LRHAFRLQLR IIRSNLRAVF AQVGNHGNTR TAESGDEDFF 

4 51 VLKSHFGLS* 



a090 .pep 

1 
51 
101 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 303>: 

g091 . Beq 

1 ATGGAAATAC CCXSTGCCGCC AAGTCCGGCG ACGAGGATTT TTTTGTTTGA 

51 AAGTCATTTT GGTTTTGTCC TAAAACAAAT CATATTGGGC AGGAGACGTC 

101 CGCCCTTGCC CAAGCCGCTT TCAGACGGCA TCGCGAGCCG ATTAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCT GTTTTGGTCT CCGTTTTGAC 

201 AAGCCTTGCC AAGCCATTGT TGAGCGAGCG CAAGGTCTTG GCGCACGCCX3 

251 CGTCCATCGT AATACATCAA GCCCAAATTG TATTGGGCTT GGGCATCCCC 

301 TTGTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 304; ORF 091.ng>: 

g091.pep 

1 MEIPVPPSPA TRIFLFESHF GFVLKQIILG RRRPPLPKPL SDGIASRLIT 
51 RLQ ALVIVAA VLVSVLTSLA KPLLSERKVL AHAAS IVIHQ AQIVLGU3IP 
101 LF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 305>: 

m091 . seq 

1 ATGGAAATAC CCGTACCGCC GAGTCCGGCG ACGAGGATTT TTTTGTTTGA 

51 AAAGTCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGAGATGTC 

101 CGCCCCTGCC CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCC GTCTTGGTCT CCGTTTTGAC 

201 AAGCCTTGCC AAACCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

251 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGCTT GGGCTACCCC 

301 CTGCGC... 

This corresponds to the amino acid sequence <SEQ ID 306; ORF 091>: 

m091 .pep 

1 MEIPVPPSPA TRIFLFEKSF WFVLKQIILS RRCPPLPKPL SDGIASCSIT 
51 RLQ ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAASF GIHH AQIVLGLGYP 
101 LR. 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 091 shows 84.2% identity over a 101 aa overlap with a predicted ORF (ORF 091. ng) 
from N. gonorrhoeae: 

m091/g091 

10 20 30 40 50 60 

m091.pep MEIPVPPSPATRIFLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 

llllllllllllllllh I lllllllhll lllllllllllll lllllillllll 

g 0 9 1 ME I PVP PS PATRI FLFESHFGFVLKQI I LGRRRPPLPKPLSDGI AS RLI TRLQALVI VAA 

10 20 30 40 50 60 

70 80 90 100 

mO 9 1 . pep VLVSVLTSLAKPFLCKGAVLAHAASFGIHHAQI VLGLGYPLR 

IIIIIIIIIM|:| : llllllh Ihllllllll II 
gO 9 1 VLVSVLTSLAKPLLSERKVLAHAAS I VIHQAQI VLGLGI PLFX 

70 80 90 100 



The following partial DNA sequence was identified in N. meningiditis <SEQ ID 307>: 

a091.seq 

1 ATGGAAATAC CCGTGCCGCC AAGTCCGGCG ACGAGGATTT TTTTGTTTTG 

51 GAAATCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGGGATGTC 

101 TGATCCTGCT CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCTTTCAGG CGTTGGTCAT TGTCGCAGCT GTCTTGGTAT CCGTTTTGAC 

201 AAGCCTTGCC AAGCCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

251 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGC 
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This corresponds to the amino acid sequence <SEQ ID 308; ORF 091. a>: 

a091.pep 

1 MEIPVPPSPA TRIFLFWKSF WFVLKQIILS RGCLILLKPL SDGIASCSIT 
51 RFQ ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAASFGIHH AQIVLG 

m09i/a09i 93.8% identity over a 96 aa overlap 

10 20 30 40 50 60 

in091.pep MEIPVPPSPATRIFLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 
I t i I i M t I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 1 I I I I I I I I : I I I I I I I I 
a091 MEIPVPPSPATRIFLFWKSFWFVLKQIILSRGCLILLKPLSDGIASCSITRFQALVIVAA 

10 20 30 40 50 60 



70 80 90 100 

m091 . pep VLVSVLTSLAKPFLCKGAVLAHAASFGIHHAQIVLGLGYPLR 

I I It I I I I I i I I I I I I I I I I I I i I I I I I I I I I I I I I 
a091 VLVSVLTSLAKPFLCKGAVLAHAASFGIHHAQIVLG 

70 80 90 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 309>: 

g092 .seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGTGCGC 

51 AAACGGTCAG ACCTTTAAAA TAACGCCTTT ACGCACTAAA AACCAACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTAA GCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 CTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

251 TGAGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 

301 AACGGTgcgg ATGTCGTCGT TGCCTCTACC GCCGTCAAGA AAGAAaatcC 

351 CGAAGTtgtc gcTGCGTTGG AGCGGCAAAT TCCCGTTATT CCGCGCGCCT 

401 TGATGCTGGC AGAGCTGATG CGCTTCCGTG ACGgcatcgc cattgccggT 

451 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 

501 GGCAGGACTC GACCCCACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 

551 GCACCAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 

601 GAATCCGATG CCTCTTTCCT ACATCTGACC CCGATTATGT CCGTCGTTAC 

651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 

701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 

751 GCCTTTTTGT GTGTTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 

801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 

851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 

901 CAAATGAAAG GACATGAGCA GGGGTCGTTT GAAGTCGTGC TGAATATGCC 

951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGc gtggcGCTgg 

1001 aagtcGgCGC ATcggttgAA GCGAtcCAAA AaggCTTGCT CGGCTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAATAcggc gacatCAagt tgccaaacgg 

1101 cggGaccgCT TTgctGGTGG ACGATTAcgg ACACCACCCC GTCGAAATGG 

1151 CGGcaaccct tgccgcTGCA CGCGGCGCGT ATCCGGAAAA acgtTTGGTG 

1201 CtcgCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTTACCAAA GTACTCAATA CCGTTGatgC GCTGGTACTG ACCGAAGTTT 

1301 AtgccgccgG CGAAGAGCCG GTTGCCGCCG CCGactcCCG CGCCTTGGCG 

1351 CGTGCTATCC GCGTATTGGG CAAACTTGAG CCGATTTACT GCGAAAatgt 

1401 cgccgACCTG CCGCAAATGC TGATGAATGT TTTACAGGAT Ggcgatgttg 

1451 tgttgAATAT GggTgcggga agcatcaacc gcgttccttc cgcgctgttg 

1501 gaattgtcga AACAGAtttg A 

This corresponds to the amino acid sequence <SEQ ID 310; ORF 092.ng>: 

g092 .pep 

1 MFFISIRYIF VRKLWCANGQ TFKITPLRTK NQPERNIMMK NRVSNIHFVG 

51 IGGVa^SGIA EVLHNLGFKV SGSDQARNAA TEHLSSLGIQ VYPGHTAEHV 

101 NGADWVAST AVKKENPEW AALERQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 
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251 AFLCVDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLN ALAAIG VALEVGASVE AI QKGLLGFE 

351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEr4AATLAAA RGAYPEKRLV 

401 liAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP VAAADSRALA 

451 RAIRVLGKLE PIYCENVADL PQMLMNVLQD GDWLNMGAG SINRVPSALL 

501 ELSKQI* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 31 1>: 

m092 . seq 

1 ATGTTTTTTA TTTCT^TCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGTCAG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTTA CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGCATCGCC GAAGTCTTGC ACAATTTGGG 

201 CTTTAAAGTT TCCGGTTCGG ATCAgGCGCG AAATGCCGCT ACCGAGCATT 

251 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC CGAACACGTT 

301 AACGGTGCGG ATGTCGTCGT TACCTCTACC GCCGTCAAAA AAGAAAATCC 

351 CGAAGTTGTC GCTGCGTTGG AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 

401 TGATGTTGGC GGAGTTGATG CGCTTCCGTG ACGGCATCGC CATTGCCGGC 

451 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 

501 GGCAGGACTT GACCCGACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 

551 GCACTAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CX5AAGCCGAC 

601 GAGTCGGATG CATCCTTTCT GCACCTGACA CCGATTATGT CCGTCGTTAC 

651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 

701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 

751 GCCTTTTTGT GTATTQACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 

801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 

851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 

901 CAAATGAAAG GACATGAGCA GGGGTCGTTT 6AAGTCGTGC TGAATATGCC 

951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGC GTGGCGCTGG 

1001 AAGTCGGCGC ATCGGTTGAA GCGATCCAAA AAGGCTTGCT CGGCTTTGAA 

1051. GGCGTCGGCC GCCGCTTCCA AAAATACGGC GACATCAAGT TGCCAAACGG 

1101 CGGGACCGCG CTCTTGGTGG ACGACTACGG ACACCACCCC GTCGAAATGG 

1151 CGGCGACCCT TGCCGCCGCA CGCGGCGCGT ATCTGGAAAA ACGTTTGGTA 

1201 CTCGCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTTACCAAA GTCCTCAATA CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 

1301 ATGCCGCCGG TGAAGAGCCG ATTGCCGCCG CCGATTCCCG CGCTCTTGCC 

13 51 CGCGCCATCC GCGTGTTGGG CAAACTCGAG CCGATTTACT GCGAAAACGT 
1401 TGCCGATCTG CCCGAAATGC TGTTGAACGT TTTGCAGGAC GGCGACATCG 

14 51 TGTTGAATAT GGGCGCGGGA AGCATCAACC GCGTCCCCGC CGCGCTGCTG 
1501 GCATTGTCGA AACAGATTTG A 

This corresponds to the amino acid sequence <SEQ ID 312; ORF 092>: 

m092 .pep 

1 MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 

101 NGADVWTST AVKKENPEW AALEQQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLN ALAAIG VALEVGASVE AI QKGLLGFE 

351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLAAA RGAYLEKRLV 

401 liAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP lAAADSRALA 

451 RAIRVLGKLE PIYCENVADL PEMLLNVLQD GDIVLNMGAG SINRVPAALL 

501 ALSKQI* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF fi"om N. gonorrhoeae 

ORF 092 shows 96.6% identity over a 506 aa overlap with a predicted ORF (ORF 092.ng) 
from N. gonorrhoeae: 

Tn092/g092 

10 20 30 40 50 60 

m092 .pep MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 
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g092 



Mlllllllllllll Mil lllllll :| llllllllllhllllllllllllllll 

MFFISIRYIFVRKLWCANGQTFKITPliRTKNQPERNIMMKNRVSNIHFVGIGGVGMSGIA 
10 20 30 40 50 €0 



70 BO 90 100 110 120 

m092 . pep EVLHNI/3FICVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADVWTSTAVKKENPEW 

I I I I I I I I I I I I M I I II II II I I : II Ml II II II I II I I II I I I h II II I I II I I I I 
g092 EVLHNIiGFKVSGSDQARNAATEHLSSLGIQVYPGHTAEHVNGADVVVASTAVKKENPEVV 
70 80 90 100 110 120 

130 140 150 160 170 180 

m092 . pep AALEQQIPVIPRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

lllhlllllllllllllllllllllllllllllllllllllllllllllllllllllll 
g 0 9 2 AALERQ I PVI PRALMLAELMRFRDGI AI AGTHGKTTTTSLTAS I LGAAGIiDPTFVI GGKL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m092 . pep NAAGTNARIiGKGEYIVAEADESDASFLHLTPIMSWTNIDEDHMDTYGHSVEKLHQAFID 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
g092 NAAGTNARIiGKGEYIVAEADESDASFLHLTPIMSWTNIDEDHMDTYGHSVEKLHQAFID 
190 200 210 220 230 240 

250 260 270 280 290 300 

m092 . pep FIHRMPFYGKAFLCIDSEHVRAILPKVSKPYATYGLDDTADIYATDIEIWGAQMKFTVHV 

IIIIMIIIIIIIhlllllllllllllllllllllllllllllllllllllllllllll 

g092 FIHRMPFYGKAFLCOTSEHVRAILPK^SKPYATYGLDDTADIYATDIENVGAQMKFTVHV 
250 260 270 280 290 300 

310 320 330 340 350 360 

mO 92 . pep QMKGHEQGSFEWLNMPGRHNVLNAIAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 
g092 QMKGHEQGSFEVVLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 
310 320 330 340 350 360 

370 380 390 400 410 420 

m092 .pep DIKLPNGGTALLVDDYGHHPVEMAATLAAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 

llllllllllllllllllllllllllllllllll IIIIIIIIIIIIIIIIIIIIIMII 

g092 DIKLPNGGTALLVDDYGHHPVEMAATLAAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 
370 380 390 400 410 420 



430 440 450 460 470 480 

m092 . pep VLNTVDALVLTEVYAAGEEPIAAADSRAIARAIRVLGKLEPIYCEOTADLPEMLLN^ 

llll'lllllllllllllllhlllllllllllllllllllilllllllllhlhlllll 
g092 VLNTVDALVLTEVYAAGEEPVAAADSRALARAIRVLGKLEPIYCENVADLPQMLMOTL^ 
430 440 450 460 470 480 



490 500 
m092 . pep GDIVLNMGAGSINRVPAALLALSKQIX 

ihiiiiiiiiiiiihiii mill 

g092 GDWLNMGAGSINRVPSALLELSKQIX 
490 500 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3 13>: 

a092 . seq 

1 ATGTTTTTTA TTTCMTCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGTCAG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTGA CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 TTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

251 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 
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301 AACGGTGCGG ATGTCGTCGT TACCTCTACC GCCGTCAAAA AAGAAAATCC 

351 CGAAGTTGTC GCTGCGTTGG AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 

401 TGATGTTGGC GGAGTTGATG CGCTTCCGTG ACGGCATCGC CATTGCCGGC 

451 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 

501 GGCAGGACTT GACCCGACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 

551 GCACCAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 

601 GAGTCGGATG CATCCTTTCT GCACCTGACA CCGATTATGT CCGTCGTTAC 

651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGT GTTGAGAAGC 

701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 

751 GCCTTTTTGT GTATTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 

801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 

851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 

901 CAAATGAAAG GACATGAGCA GGGGTCGTTT GAAGTCGTGC TGAATATGCC 

951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGC GTGGCGCTGG 

1001 AAGTCGGCGC ATCGGTTGAA GCGATCCAAA AAGGCTTGCT CGGCTTTGAA 

1051 GGTGTCGGCC GCCGCTTCCA AAAATACGGC GACATCAAGT TGCCAAACGG 

1101 TGGAACCGCG CTCTTGGTGG ACGACTACGG ACACCACCCC GTCGAAATGG 

1151 CGGCGACCCT TTCCGCCGCA CGCGGCGCGT ATCCGGAAAA ACGTTTGGTA 

1201 CTCGCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTTACCAAA GTCCTCAATA CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 

1301 ATGCCGCCGG TGAAGAGCCG ATTGCCGCCG CTGATTCCCG CGCTCTTGCC 

1351 CGCGCCATCC GCGTGTTGGG CAAACTCGAG CCGATTTACT GCGAAAACGT 

14 01 TGCCGATCTG CCCGAAATGC TGTTGAACGT TTTGCAGGAC GGCGACATCG 

14 51 TGTTGAATAT GGGTGCGGGA AGCATCAACC GCGTCCCCGC CGCGCTGCTG 

1501 GAATTGTCGA AACAGATTTG A 



This corresponds to the amino acid sequence <SEQ ID 314; ORF 092. a>: 

a092 .pep 

1 MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 

101 NGADVWTST AVKKENPEW AALEQQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVTVEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLNA LAAIG VALEVGASVE AI QKGLLGFE 

351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLSAA RGAYPEKRLV 

401 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP lAAADSRALA 

451 RAIRVLGKLE PIYCENVADL PEMLLNVLQD GDIVLNMGAG SINRVPAALL 

501 ELSKQI* 



m092/a092 99.4% identity over a 506 aa overlap 



10 20 30 40 50 60 

m092.pep MFFISIRYIFVRKLWRAKGQPFKITPLRIENPPERNIMMKNRVTNIHFVG IGGVGMSGIA 

I I I I I t I I I I I I I I I I I I I I I I I I I I I I I i I I 1 I t I I I I I I I i I I I I I I I I I I I I t I I I I 
a092 MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 

10 20 30 40 50 60 



70 80 90 100 110 120 

mO 92 . pep EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADWVTSTAVKKENPEW 
I 1 I I I t I I i I I I I I I I I I I I I I I I M I I I i I I t 1 M I I I I I M I I I I I I I I I I I I I I I I I 
a092 EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADWVTSTAVKKENPEW 

70 80 90 100 110 120 



130 140 150 160 170 180 

m092 . pep AALEQQIPVI PRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

I I I I I I I I I I I I I I M I i I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
a092 AALEQQIPVI PRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m092 . pep NAAGTNARLGKGEYIVAEADES DAS FLHLTPIMSWTN I DEDHMDTYGHS VEKLHQAFID 

I M M I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I i i I I I I M I i I I I I t I I I I I I I j I 
a0 92 NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSWTNIDEDHMDTYGHSVEKLHQAFID 
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190 200 210 220 230 240 

250 260 270 280 290 300 

m092 . pep FIHRMPFYGKAFLCIDSEHVRAILPKVSKPYATYGLDDTADIYATDIENVGAQMKFTVHV 
I I I I I I I I I I I I t I I I I 1 t I I I t I I I I I I I 1 I I I I I I I I i I I I i t I M i I I i I I j I I j I I 
a092 FIHRMPFYGKAFLCIDSEHVRAILPKVSKPYATYGLDDTADIYATDIENVGAQMKFTVHV 

250 260 270 280 290 300 

310 320 330 340 350 360 

m092 . pep QMKGHEQGSFEVVLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 
I i I I I I I I I I i I I I 1 I I I I I I I I ) I I I I I ) I I I I I I I I I I I I M I I I I I I I I I I t I I I I I 
a092 QMKGHEQGSFEWLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

310 320 330 340 350 360 

370 380 390 400 410 420 

m092 . pep DIKLPNGGTALLVDDYGHHPVEMAATLAAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 
I I I I I I I I j I I ) I I I I 1 I I I I I I i I I I : I i I I I I I I t I I I I I I I I t I I I I I 1 I I I I I I I 
a092 DIKLPNGGTALLVDDYGHHPVEMAATLSAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 

370 380 390 400 410 420 

430 440 450 460 470 480 

m092 . pep VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPI YCENVADLPEMLLNVLQD 

I I I I I I I I I ) t I I I i I M I I I I I [ I I I I I I I I I I I I I i I I I I I I I I I i I I I M I I I I I I I 

a092 VLNTVDALVLTEVYAAGEEPITVAADSRALARAIRVLGKLEPIYCENVADLPEMLLNVLQD 
430 440 450 460 470 480 

490 500 
m092 .pep GDIVLNMGAGSINRVPAALLALSKQIX 
I I I I I I I I I t I I i i I I I I I i I I I I I I 
a092 GDIVLNMGAGSINRVPAALLELSKQIX 

490 500 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 315>: 

g093 . seg 

1 aTGCAGAATt ttgGCAAAGT ggccgtATTG ATGGGtggtT TTTCCAGCGA 

51 ACGAGAaatC tcgcTGGACA GCgGTACCGC CATTTTGAAC GCCTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGACC CTAAGGAAAC GCCGTTATCC 

151 GAACTGAAGG AGCGGGGCTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCCTCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTACCCGTTC CCGAGTTCGC 

351 CGTACTGTAC GATGATACCG ATTTCGATGC CGTCGAAGAA AAATTGGGTC 

401 TGCCGATGTT TGTGAAGCCG GCGGCCGAAG GCAGCAGCgt cggcgtggta 

451 aAAGTCAAAG AAAaaggccg TCTGAAAAGC GTTtacgaag aatTGAaaCA 

501 CCTTcagggg cgaAAtcatt gccgAacgTT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATCCC 

601 CGCAACCGAG TTTTACGAct acgaagccaa GtacaaCCGA GACGAcacca 

651 tttaTCAATG TCCTTCGGAA GATTTGACCG AAGCCGAAGA AAGCCTGATG 

701 CGCGAACTGG CGGTTCGCGG CGCACAGGCA ATCGGTGCGG AAGGCTGCGT 

751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACCGGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 316; ORF 093,ng>: 

g093 .pep 

1 MQNFGKVAVL mggfsserei sldsgtailn alkskgiday afdpketpls 

51 elkergfqta fnilhgtyge dgavqgalel lgipytgsgv aasaigmdky 

101 rckliwqalg lpvpefavly ddtdfdavee klglpmfvkp aaegssvgw 

151 kvkekgrlks vyeelkhlqg rnhcrtfyrr rriflprper qraarhthhp 

201 rnrvlrlrsq vqprrhhlsm sfgrfdrsrr kpdartggsr rtgnrcgrlr 

251 ARRFPQRYRR QTLSVGNQHP ARYDRP* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 317>: 

m093 . seq 
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1 


ATGCAGAATT 


51 


ACGAGAAATC 


101 


GCAAAGGCAT 


151 


GAATTGAAGG 


201 


TTACGGCrAA 


251 


CCTATACCGG 


301 


CGCTGCAAAC 


351 


CGTCCTGCAC 


401 


TGCCGATGTT 


451 


AAAGTCAAAG 


501 


CCTTCAGGG. 


551 


TCCTGCCCCG 


601 


CGCAACCGAG 


651 


TTTATCAATG 


701 


CGCGAACTGG 


751 


GCGCGTCGAT 


801 


TCAACACCCT 



CGAAATCATT GCCGAACGTT TTATCGGCGG CGGCGAATAT 



This corresponds to the amino acid sequence <SEQ ID 318; ORF 093>: 
m093 .pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKAQGFQTA FNILHGTYGX DGAVQGALEL LGIPYTGSGV T^AIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKGKGRLKS VYEELKHLQX RNHCRTFYRR RRIFLPRPER QRAARHTHHS 

201 RNRVIiRLRSQ VQPRRHHLSM SFGRFDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDEP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 093 shows 96.7% identity over a 276 aa overlap with a predicted ORF (ORF 093 .ng) 
from A^. gonorrhoeae: 

m093/g093 

10 20 30 40 50 60 

m093 . pep MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKAQGFQTA 

llllllllillllllllllllllllllllllllllllllllllllllllllll :||||| 
g093 MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKERGFQTA 
10 20 30 40 50 60 

70 80 90 100 110 120 

mO 9 3 . pep FNI LHGT YGXDGAVQGTUjELLGI P YTGS GVAAS AIGMDKYRCKLI WQAI/5LPVPEFAVLH 

lllllllll lllllllllllllllllllllllllllllllllllllllllllllilll: 
g093 FNIIiHGTYGEDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQAIxGLPVPEFAVLY 

70 80 90 100 110 120 

130 140 150 160 170 ISO 

mO 9 3 . pep DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 

I II II 1 1 III II I II II Mill II 1 11 III II I lllllllllllllll llllllllll 

g093 DDTDFDAVEEKIiGLPMFVKPAAEGSSVGWKVKEKGRLKSVYEELKHLQGRNHCRTFYRR 
130 140 150 160 170 180 

190 200 210 220 230 240 

m093 . pep RRI FLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 

Illllllllllllllllll IIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIMIIIII 

g093 RRIFLPRPERQRAARHTHHPRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 
190 200 210 220 230 240 



250 260 270 - 

mO 9 3 . pep RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDEPX 

hMlllllllllllllllllllllllilllllhll 

g093 RTGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDRPX 
250 260 270 
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The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 319>: 

a093.seq 

1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGATC CCAAGGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 

351 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

401 TGCCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 

4 51 AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 

501 CTTTCAGGG. CGAAATCATT GCCGAACGGT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCTG TGTTGAACGG CT^GGCCTG CCCGGCATAC ACATCATCCC 

601 CGCGACCGAG TTTTATGACT ACGAAGCCAA GTACAACCGC AACGACACCA 

651 TTTATCAATG TCCTTCGGAA GATCTGACCG AAGCCGAAGA AAGCCTGATG 

701 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 

751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACCGGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 320; ORF 093. a>; 

a093.pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKAQGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKGKGRLKS VYEELKHFQX RNHCRTVYRR RRIFLPCVER QRPARHTHHP 

201 RDRVL*LRSQ VQPQRHHLSM SFGRSDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDRP* 

m093/a093 95.7% identity over a 276 aa overlap 

10 20 30 40 50 60 

ni093.pep MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKAQGFQTA 
I I I I I I I I I I I I I I I I I I I i I I I I i I M I I I I I M I I i I I I 1 I I i I I I I I I I I I I I i i I I 
a093 MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKAQGFQTA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m093.pep FN I LHGTYGXDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKL I WQALG LPVPEFAVLH 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I 
a093 FNILHGTYGEDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m093 . pep DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 
I I I I I I I I I I I I I I I M I I I I I i I It I I I ) I I I I 1 I I I I M I I I I I I : t I I I i I I I III 
a093 DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHFQXRNHCRTVYRR 

130 140 150 160 170 180 

190 200 210 220 230 240 

m093.pep RRIFLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 

Mini nil linn inn 1 1 1 1 1 ii n ii i n i ii i iniiiiiniiiii 

a093 RRIFLPCVERQRPARHTHHPRDRVLXLRSQVQPQRHHLSMSFGRSDRSRRKPDARTGGSR 
190 200 210 220 230 240 

250 260 270 

m093 . pep RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDEPX 

I n 1 1 1 1 1 II I II 1 1 II 1 1 n 1 1 n 1 1 1 i II II I n I 

a093 RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDRPX 
250 260 270 
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The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 32 1>: 

g094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTGG GGTCGAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTAccggc aatggcgatg 

151 CCGTCACGGA AGCGCATCAG CTCTGCCAGC ATCAAGGCGC GCGGAATAAC 

201 GGGAATTTGC CGCTCCAACG CAgcgacaAC TTCGGgattT TCTTTCTTGA 

251 CGGCGGTAGA GGCAACGACG ACATccgcAC CGTTAACGTG TTCTGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 322; ORF 094.ng>: 

g094 .pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMAM 
51 PSRKRISSAS IKARGITGIC * RSNAATTSGF SFLTAVEATT TSAPLTCSAV 
101 WPG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 323>: 

m094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTA GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCATCAA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCGGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 324; ORF 094>: 

tn094 .pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMAM 
51 PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 
101 WPG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 094 shows 95.1% identity over a 103 aa overlap with a predicted ORF (ORF 094.ng) 
from A^. gonorrhoeae: 

m094/g094 



10 20 30 40 50 60 

m094 .pep MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLWWLPCVPAMAMPSRKRINSAN 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIhlh 

g 0 9 4 MYS PLPKRALVPAALS LP P I TKVGS S PAAPRMEAVRLWWLPCVPAMAMPSRKRI SS AS 

10 20 30 40 50 60 



70 80 90 100 

m094 .pep IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 

IHIIIIIII lllllllllllllil|:|||||||||||||||| 
g094 IKARGITGICRSNAATTSGFSFLTAVEATTTSAPLTCSAVWPGX 
70 80 90 100 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 325>: 

a094 .seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCATCAA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCTGCGGTA 

301 TGGCCGGGAT AA 



This corresponds to the amino acid 



sequence <SEQ ID 326; ORF 094.a>: 
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a094 .pep 

1 
51 
101 



MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMA M 
PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 
WPG* 



in094/a094 1 00.0% identity over a 1 03 aa overlap 

10 20 30 40 50 60 

m094.pep MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLWWLPCVPAMAMPSRKRINSAN 
I I t I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I i I I I I I I I I I I I i I I 
a094 MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLWWLPCVPAMAMPSRKRINSAN 

10 20 30 40 50 60 



70 80 90 100 

m094 . pep IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 
I t I I I I I I I I I I i I I I I I f I I I I f I i I I I i I II I I I I I I I I I I I 
a094 IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 

70 80 90 100 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 327>: 

g095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AACACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGTTTT TCGACGCTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGGGTCA GTGTAGGAAA 

3 01 GAGGCATCGG ATCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTGG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ID 328; ORF 095.ng>: 

g095 .pep 

1 MSFHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDK RL MQFFDAVPVG IHMVFVD ICaJ DGHNRGQCRK 

101 EASDRRLRQR CIRLCPSGRW CLRR* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 329>: 

Tn095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AATACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGTTTT TCGACGCTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGTGTCA GTGCAGAAAG 

301 GATGCATCCG ACCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTAG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ID 330; ORF 095>: 

m095 .pep 

1 MSFHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDK RL MQFFDAVPVG IHMVFVD IGN DGHNRCQCRK 

101 DASDRRLRQR CIRLCPSGRX CLRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF fi-om A^. gonorrhoeae 

ORF 095 shows 97.6% identity over a 124 aa overlap with a predicted ORF (ORF 095.ng) 
fi"om A^. gonorrhoeae: 

m095/g095 



10 20 30 40 50 60 

m095 . pep MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 

IIIIIMIIIIIIIIIIIIMIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIMIIIII 
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g 0 9 5 MS FHLNMDGEFHIiRADVFDVGGVDVGGI VQTVSS I RFAHFGQNRADVFAVNTQKGFAVEG 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 9 5 . pep HTVDE IDKRLMQFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRX 

lllllllllllllllllllllllllllllllllll lllhlllllllllllllillll 
g095 HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRGQCRKEASDRRLRQRCIRLCPSGRW 
70 80 90 100 110 120 



m095.pep CLRRX 

nil! 

g095 CLRRX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 331>: 



a095. seq 








1 


ATGTCCTTTC 


ATTTGAACAT 


GGACGGTGAA 


51 


TTTCGATGTC 


GGTGGCGTAG 


ATGTCGGCGG 


101 


GCATAAGGTT 


TGCTCACTTT 


GGGCAAAATC 


151 


AATACACAAA 


AAGGCTTTGC 


CGTAGAAGGG 


201 


TAAACGCCTG 


ATGCAGCTTC 


TCAACACTGT 


251 


TCTTCGTCGA 


TATTGGTAAC 


GACGGACATA 


301 


GATGCATCCG 


ACCGTCGGCT 


TCGGCAACGA 


351 


CGGGCGTTGG 


TGCCTGCGGC 


GTTGA 



This corresponds to the amino acid sequence <SEQ ID 332; ORF 095.a>: 

a095.pep 

1 MSFHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDKRL MQLLNTVPVG IHMVFVDIGN DGHNRCQCRK 

101 DASDRRLRQR CIRLCPSGRW CLRR* 

m095/a095 96,0% identity in 124 aa overlap 

10 20 30 40 50 60 

m095 . pep MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 
I I I I I I I I I I I I I I I I I i I I I I i I I I I I I I I M I I I i I I I I I I I I I I I I I I I I I I I I M I 
a095 MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 

10 20 30 40 50 60 

70 80 90 100 110 120 

in095.pep HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRX 

I I I M I I I I I I I :::: I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I 
a095 HTVDEIDKRLMQLLNTVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRW 

70 80 90 100 110 120 

m095.pep CLRRX 
Mill 

a095 CLRRX 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 

333>: 

g096 . seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGTGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAAATCT TCCGMGGAC ATTGAtaaat ggtgTCGTCT CGGttgtaCt 
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201 tggcttcgta gTCGTAAAAC TCGGTTGCGG GGATGATGTG TATGCCGGGC 

251 AGCCCTTTGC CGTTCAGGAC GGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 AcgtTcggca atgaTTtcgc ccctgAAGGT GttTCAattc ttcgtaAACG 

351 CTTTTCAGAc ggccttTTTC TTTGA 



This corresponds to the amino acid sequence <SEQ ID 334; ORF 096.ng>: 

g096 .pep 

1 MAGHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLCAANR QFAHQAFFGF 

51 GQIFRRT LIN GVVSWLGFV WKL GCGDDV YAGQPFAVQD GAGIFAAADK 

101 TFGNDFAPEG VSILRKRFSD GLFL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 335>: 



m096.8eq 

1 ATGGCTCGTC ATACCGGGCA 

51 CGTCGGTATC TTTGAGGAAA 

101 GCCTGCGCGC CGCGAACCGC 

151 GGTCAAATCT TCCGAAGGAC 

201 TGGCTTCGTA GTCGTAAAAC 

251 AGCCCTTTGC CGTTCAGGAC 

301 ACGTTCGGCA ATGATTTCGC 

351 CTTTTCAGAC GGCCTTTTCC 



GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 
TCGACGCGCA CGCAGCCTTC CGCACCGATT 
CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 
ATTGATAAAT GGTGTCGTCG CGGTTGTACT 
TCGGTTGCGG GAATGATGTG TATGCCGGGC 
GGGGCAGGAA TATTCGCCGC CGCCGATAAA 
CCTGAAGGT GTTTCAATTC TTCGTAAACG 
TTTGA 



This corresponds to the amino acid sequence <SEQ ID 336; ORF 096>: 

m096.pep 

1 MARHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLRAANR QFAHQAFFGF 

51 GQIFRRT LIN GVVAWLGFV WKL GCGNDV YAGQPFAVQD GAGIFAAADK 

101 TFGNDFAXEG VSILRKRFSD GLFL* 

itt096/g096 96,0% identity in 124 aa overlap 



10 20 30 40 50 60 

m096 . pep MARHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLRAANRQFAHQAFFGFGQIFRRTLIN 
II II II II II t II M I II II II II II t I I II I II I I I I I I I I I t I I I I I I I I I I I I I I 
g096 MAGHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLCAANRQFAHQAFFGFGQIFRRTLIN 

10 20 30 40 50 60 



70 80 90 100 110 120 

m096 . pep GVVAWLGFVWKLGCGNDVYAGQPFAVQDGAGIFAAADKTFGNDFAXEGVSILRKRFSD 
I I I : I II I I I II I I II I : I M II I I I I I I I I M I I I I t II I I I ) I I I I I I I I I I II I II 
g096 GWSWLGFVWKLGCGDDVYAGQPFAVQDGAGIFAAADKTFGNDFAPEGVSILRKRFSD 

70 80 90 100 110 120 



m096.pep GLFLX 
I I I I I 

g096 GLFLX 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 337>: 
a096.seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGCGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAGATCT TCCGAAGGAC ATTGATAAAT GGTGTCGTTG CGGTTGTACT 

201 TGGCTTCGTA GTCATAAAAC TCGGTCGCGG GGATGATGTG TATGCCGGGC 

251 AGCCCTTTGC CGTTCAACAC AGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 CCGTTCGGCA ATGATTTCGC CCT.GAAAGT GTTTCAATTC TTCGTAAACG 

351 CTTTTCAGAC GGCCTTTTCC TTTGA 

This corresponds to the amino acid sequence <SEQ ID 338; ORF 096.ng>: 
a096,pep 

1 MAGHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLRA7VNR QFAHQAFFGF 



wo 99/57280 



PCT/US99/09346 



299 



51 GQIFRRT LIN GWAVVLGFV VIKL GRGDDV YAGQAFAVQH RAGIFAAADK 
101 PFGNDFAXES VSILRKRFSD GLFL* 



m096/a096 92.7% identity in 124 aa overlap 

10 20 30 40 50 60 

m096.pep MARHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLRAANRQFAHQAFFGFGQIFRRTLIN 
II M I I I M I I I I I II II I I I I I I I I I II II I I I I I I I I I I II I M It II I 1 I I I I I I I 
a096 MAGHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLRAANRQFAHQAFFGFGQIFRRTLIN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m096 . pep GWAVVLGFWVKLGCGNDVYAGQPFAVQDGAGIFAAADKTFGNDFAXEGVSILRKRFSD 
11111111111:111 1:111111 III! II I M I II I I II I I I M : I I I I I I I I I I 
a096 GWAVVLGFWIKLGRGDDVYAGQAFAVQHRAGIFAAADKPFGNDFAXESVSILRKRFSD 

70 80 90 100 110 120 



m096.pep GLFLX 
I I II I 

a096 GLFLX 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 339>: 

g097 . seq 

1 ATGGATATTT CAAAACAAAC ATTGCTGGAT AGGGTTTTTA ACCTGAAGGC 

51 AAACGGTACG ACGGTACGTA CCGAGTTGAT GGCGGGTTTG ACGACCTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAATC CCCTGATTTT GGGCGAGACC 

151 GGAATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CATCCGCCAT 

201 CGGCTGTTTT GTCATGGGTT TTATCGGCAA CTATCCGATT GCGCTTGCCC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

301 GGCGTGCCTT GGCAGGTGGC GTTGGGTGCG GTGTTCATTT CCGGTCTGAT 

351 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCCG CCGGTATCGG TTTGTTTTTG 

451 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 

501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC CAGCGCACTG TTGGCATTGT 

551 TCGGTTTTGT CATGGTGGTC GTATTGGGGT ATTTCCGCGT TCAAGGCGCA 

601 ATCATCATCA CCATTCTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 

651 GAACGAGTTT CACGGCGTGG TCGGCGAAGT ACCGGGCATT GCGCCGACCT 

701 TTATGCAGAT GGATTTTAAA GGTCTGTTTA CCGTCAGCAT GGTCAGCGTG 

751 ATTTTCGTCT TCTTCTTGGT CGATTTGTTC GACAGTACCG GAACGCTGGT 

801 CGGCGTATCC CACCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 

851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CCATTGTGGC AGGTGCGGCT 

901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 

951 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TGGCGTGTCT GATGTTCTCC CCATTGGCGA AAAGTGTTCC GGTATTTGCC 

1051 ACCGCGCCCG CACTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATTGAT TGGGACGATA TGACTGAAGC CGCGCCCGCG TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCCTTCGGC 

1201 TTCATCAGCT ATGCCGTGGT CAAACTTTTG TGTCGCCGGA CTGGGGACGT 

1251 GCCGCCTATG GTATGGGTTG TTGCCGTATT GTGGGCATTG AAATTCTGGT 

1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 340; ORF 097.ng>: 

g097 .pep 

1 MDISKQTLLD RVFNLKANGT TVRTE LMAGL TTFLTMCYIV IVN PLILGET 

51 GMDM GAVFVA TCIASAIGCF VM GFIGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPW QVALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAG IGLFL 

151 ALISLKGAGI IV ANPATLVG LGDIHQPS AL LALFGFVMW VLGYF RVQGA 

201 IIITILTITV lASLMGL NEF HGWGEVPGI APTFMQMDFK GLFTVSMVSV 

251 IFVFFLVDLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPVFA 

351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 
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401 FISYAWKLL CRRTGDVPPM VWWAVLWAL KFWYLG * 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 341>: 

m097.seq 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 

51 AAACGGTACk ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC CTCyGATTTT GGGCGAGACC 

151 GGCATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 

201 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA CTATCCGATT GCACTCGCAC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

3 01 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 

351 TTTTATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

401 TGCCTATGGG TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 

451 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 

501 CTTGGTCGGT TTGGGCX3ATA .TTCATCAGCC GTCCGCGTTG TTGGCATTGT 

551 TCGGTTTTGC TATGGTGGTC GTATTGGGAC ATTTCCGCGT TCAAGGCGCA 

601 ATCATCATCA CCATCTTGAC CATTACCGTC ATTGCCAGCC TGATGGGTTT 

651 GAATGAATTT CACGGCATCA TCGGCGAAGT ACCGAGCATT GCGCCGACTT 

701 TTATGCAGAT GGATTTTGAA GGCCTGTTTA CCGTCAGCAT GGTCAGTGTG 

751 ATTTTCGTCT TCTTCTTGGT CGATCTATTT GACAGTACCG GAACGCTGGT 

801 CGGCATATCC CACCGTGCCG GGCTGCTGGT GGACGGTAAG CTGCCCCGCC 

851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CCATTGTGGC AGGTGCGGCT 

901 TTGGGTACTT CTTCCACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 

951 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 

1051 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGATATTGAT TGGGACGATA TGACGGAAGC CGCACCTGCG TTCCTGACCA 

1151 TTGTTTTCAT GCCGTTTACT TATTCGATTG CAGACGGCAT CGCTTTCGGC 

1201 TTCATCAGTT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 

1251 TCCGCCTATG GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 

1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 342; ORF 097>: 

m097 .pep 

1 MDTSKQTLLD GIFKLKANGT TVRTE LMAGL TTFLTMCYIV IVN PXILGET 

51 GMDM GAVFVA TCIASAIGCF VM GFVGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPW QVALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAG IGLFL 

151 ALISLKGAGI IVA NPATLVG LGDIHQPS AL LALFGFAMW VLGHFR VQGA 

201 IIITILTITV lASLMGLN EF HGIIGEVPSI APTFMQMDFE GLFTVSMVSV 

251 IFVFFLVD LF DSTGTLVGIS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPAFA 

351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

401 FISYAWKLL CRRTKDVPPM VWIVAVLWAL KFWYLG * 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N, gonorrhoeae 

ORF 097 shows 96.3% identity over a 436 aa overlap with a predicted ORF (ORF 097.ng) 
from N. gonorrhoeae: 

m097/g097 

m097 .pep 
g097 



m097 .pep 
g097 

70 80 90 100 110 120 



10 20 30 40 50 60 

MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPXILGETGMDMGAVFVA 

II lllllll :|:|IIIIIIIIIIIIIIIIIIIMIIIIIIII lllllllllllllll 
MDISKQTLLDRVFNLKANGTTVRTELMAGLTTFLTMCYIVIVNPLILGETGMDMGAVFVA 
10 20 30 40 50 60 

70 80 90 100 110 120 

TCIASAIGCFVMGFVGNYPIALAPGMGLNAYFTFAWKC»1GVPWQVALGAVFISGLIFIL 

IIMIIIIIIIIIhllllllllllllllllllllllllMIIIIIIIIIIIIIIIIIII 

TC I ASAI GCFVMGF I GN YP I ALAPGMGLNAYFTFAWKGMGVPWQVALGAVF I SGL I F I L 



130 140 150 160 170 180 
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m097 . pep FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 

IIIIIIIIIIIIIIIMIIIIIIIillllllllllllllllllllllllllMIIIIIII 

g097 FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 
130 140 150 160 170 180 



190 200 210 220 230 240 

m097.pep IALFGFAMVWIX3HFRVQGAIIITILTITVIASLMGLNEFHGIIGEVPSIAPTFMQMDFE 

lllllhlllllhlllllllllllllllllllllllllllhHIIhllllllllll: 
g097 I^FGFVMVVVLGYFRVQGAXIITILTITVIASLMGLl^FHGWGEVPGIAPTFMQm 

190 200 210 220 230 240 



250 260 270 280 290 300 

mO 9 7 . pep GLFTVSMVSVI FVFFLVDLFDSTGTLVGISHRAGLLVDGKLPRLKRALLADSTAIVAGAA 

lllllllllllllllllllllllllllhllllllllllllMIIIIIIMIIIIIilll 

go 9 7 GLFTVSMVSVI FVFFLVDLjPDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTAI VAGAA 

250 260 270 280 290 300 



310 320 330 340 350 360 

m097 . pep LGTSSTTP YVES AAGVSAGGRTGLTAVTVGVLMLACLMFS PLAKSVPAFATAP ALL YVGT 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIhllllllllllll 

g097 LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPVFATAPALLYVGT 
310 320 330 340 350 360 



370 380 390 400 410 420 

m097 . pep QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 

llllllllllllllllllllllllllllllllllilllllllllllllllllll Mill 

g097 QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTGDVPPM 
370 380 390 400 410 420 



430 

mO 9 7 . pep VWI VAVLWALKFW YLGX 

I I : I I t I I I I I I I i I I I 
g097 VWWAVLWALKFWYLGX 
430 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 343> 
aO 97 . seq 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 

51 AAACGGTACG ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC CTCTGATTTT GGGCGAGACC 

151 GGCATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 

201 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA CTATCCGATT GCACTCGCAC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

301 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 

351 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

401 TGCCTATGGG TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 

451 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 

501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC GTCCGCGTTG TTGGCACTGT 

551 TCGGTTTTGC CATGGTGGTC GTATTGGGAC ATTTCCGCGT TCAAGGCGCA 

601 ATCATCATCA CCATTTTGAG GATTACCGTC ATTGCCAGCC TGATGGGTTT 

651 GAACGAATTT CACGGCATCA TCGGCGAAGT GCCGAGCATT GCGCCGACTT 

701 TTATGCAGAT GGATTTTAAA GGGTTGTTTA CCGTCAGCAT GGTCAGCGTG 

751 ATTTTCGTCT TTTTCCTAGT CGATCTGTTC GACAGTACCG GAACACTGGT 

801 CGGTGTATCG CATCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 

851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CTATTGTGGC AGGTGCGGCT 

901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGTGCGG CGGGCGTATC 

951 GGCAGGCGGG CGGACAGGTC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 

1051 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATCGAT TGGGACGATA TGACGGAAGC CGCACCCGCA TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCTTTCGGC 

1201 TTCATCAGTT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 

1251 TCCGCCTATG GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 
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1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 344; ORF 097.a>: 
a097 .pep 

1 MDTSKQTLLD GIFKLKANGT TVRTE LMAGL TTFLTMCYIV IV NPLILGET 

51 GMDMG AVFVA TCIASAIGCF VM GFVGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAGIGLFL 

151 ALISLKGAGI IVA NPATLVG LGDIHQPS AL LALFGFAMVV VLGHF RVQGA 

201 IIITILTITV lASLMGL NEF HGIIGEVPSI APTFMQMDFK GLFTVSMVSV 

251 IFVFFLV DLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPAFA 

351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

401 FISYAWKLL CRRTKDVPP M VWIVAVLWAL KFWYLG * 

m097/a097 99.3% identity in 436 aa overlap 

10 20 30 40 50 60 

m097 . pep MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPXILGETGMDMGAVFVA 
I I I I I I I I i I I I I I I t M I I I I 1 I I I I I I I I t I I I I t I M I I I I I I I M I I i I I I I I I I 
a097 MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPLILGETGMDMGAVFVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m097 . pep TCIASAIGCFVMGFVGNYPIALAPGMGLNAYFTFAWKGMGVPWQVALGAVFISGLIFIL 
I I t I I i I I i I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I i I I I I I I 
a097 TCIASAIGCFVMGFVGNYPIALAPGMGLNAYFTFAWKGMGVPWQVALGAVFISGLIFIL 

70 80 90 100 110 . 120 

130 140 150 160 170 180 

m097 . pep FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 
I I I 1 I I I I i I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I t I I I I I I I I I i I I I I I I i I 
a097 FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m097 .pep LALFGFAMVVVLGHFRVQGAIIITILTITVIASLMGLNEFHGIIGEVPSIAPTFMQMDFE 
I I I I I i I I I I I I I I I I I I I I I I i I I I I I I i I M I 1 I i I I I I I I i I I I I It I I I I I I I I I : 
a097 LALFGFAMVWLGHFRVQGAIIITILTITVIASLMGLNEFHGIIGEVPSIAPTFMQMDFK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m097 . pep GLFTVSMVSVIFVFFLVDLFDSTGTLVGISHRAGLLVDGKLPRLKRALLADSTAIVAGAA 
I I i I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I i i I I I I I I I I I I I I I I 1 I I I I I I I M 
a097 GLFTVSMVSVIFVFFLVDLFDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTAIVAGAA 

250 260 270 280 290 300 

310 320 330 340 350 360 

mO 97 . pep LGTSSTTPYVESTAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 
I I t I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I M I I I I I I I I I I I 
a097 LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 

310 320 330 340 350 360 

370 380 390 400 410 420 

iu097 . pep QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 
I I I M I I I I I I I I I t I I I I I I t I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I M i I I I I 
a097 QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 

370 380 390 400 410 420 

430 

mO 97 . pep VWIVAVLWALKFWYLGX 
I I I I I I I I i I I I I I I I I 
a097 VWIVAVLWALKFWYLGX 
430 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 345>: 

g098 . seq 

1 ATGACCGCCG ACGGTCTCTT CGTCGCTTTC AACTTCAATA CGTTTGCCGT 

51 TGTGCGAATA TTGATACCAG TACAGCAGGA TGCTGCCCAG GCTGGCGATC 

101 AGTTTGTCX3G CGATGTCGCG CGCTTCGCTG TCGGGATGGC TTTCGCGTTC 

151 GGGATGAACG CAGCCGAGCA TGGACACGCC GGTACGCATC ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAAGTG TGCGACTTCT TCAAACTCGC 

351 ATTTTTGTGC CAAATTAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ED 346; ORF 098.ng>: 

g098.pep 

1 MTADGLFVAF NFNTFAWRI LIPVQQDAAQ AGDQFVGDVA RFAVGMAFAF 

51 GMNAAEHGHA GTHHVHRMGM CRQAFQNFT^H TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV CDFFKLAFLC QIRMS* 

The following partial DNA sequence was identified in M meningitidis <SEQ ID 347>: 

m098 . seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 

51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGGTGCCGAG GCTGGCGATC 

101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

351 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the aniino acid sequence <SEQ ID 348; ORF 098>: 

m098 .pep 

1 MTADGLFVAF NLNAFAWRI LIPVQEDAAE AGDQFVGDVA RFTFRMAFTF 

51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAT^GFE LGFISGQLEF 

101 VGQMAVNQQV GDFFKLAFLC QIRMS* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 098 shows 89.6% identity over a 125 aa overlap with a predicted ORF (ORF 098.ng) 
from N. gonorrhoeae: 

m098/g098 

10 20 30 40 50 60 

MTADGLFVAFNLNAFAWRI LI PVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGYA 

llllllllll|:hllllllllll|:||hlllllllllilh I I hi lllhlhl 
MTADGLFVAFNFNTFAWRI LI PVQQDAAQAGDQFVGDVARFAVGMAFAFGMNAAEHGHA 
10 20 30 40 50 60 

70 80 90 100 110 120 

GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 

iihiiiiiiiiiiiiiiiiiiiiiiiiiiiMniiiiiiiiiMiiii lllllllll 

GTHHVHRMGMCRQAFQNFNHTDRQTJ^GFELGFISGQLEFVGQMAVNQQVCDFFKLAFLC 
70 80 90 100 110 120 



m098 .pep 
g098 

m098 .pep 
g098 



m098.pep QIRMSX 
llllll 

g098 QIRMSX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 349>: 

a098.seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 
51 TGTGCGT^ATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 
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101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

351 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 350; ORF 098.a>: 

a098 .pep 

1 MTADGLFVAF NLNAFA WRI LIPVQEDAAE AGDQFVGDVA RFTFRMAFTF 

51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV GDFFKLAFLC QIRMS* 



m098/a098 100.0% identity in 125 aa overlap 

10 20 30 40 50 60 

m098 . pep MTADGLFVAFNLNAFAVVRILIPVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGYA 
I I I I I I I I I I I I t i I I t 1 I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I M I I I I I I 1 I 
a098 MTADGLFVAFNLNAFAWRILIPVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGYA 

10 20 30 40 50 60 



70 80 90 100 110 120 

mO 98 . pep GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 
I I I I I I I I I I I I I I 1 I I I I I I I M I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 
a098 GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 

70 80 90 100 110 120 



m098.pep QIRMSX 
I I I I I I 

a098 QIRMSX 



The following partial DNA sequence was 


g099.seq 






1 


ATGCTGGGAC 


GCGCGTCCAT 


51 


GCTGACGGGC 


AAACGGCAGG 


101 


CACTGACCGA 


ATTCTTGCGT 


151 


TTTTTCGGCG 


AGGGCGCGAG 


201 


TTCCAACATG 


ACGCCGGAGT 


251 


ACGCGCAAAC 


TATTGATTAT 


301 


GTGAAATTGG 


TGGAAACCTA 


351 


CTTGAAAACC 


GCCGTTTATC 


401 


TAACGCGCAA 


TATGGCAGGC 


451 


GCCGATTTGG 


CGGCGAAAGG 


501 


CGGCCAAATG 


CCTGACGGTG 


551 


CCAATACTTC 


CAACCCGCGC 


601 


AATGCCAACC 


GCCTCGGCTT 


651 


TGCCCCGGGT 


TCAAAAGTAG 


701 


TGCCCGAAAT 


GGAAAAACTC 


751 


ACCTGTAACG 


GCATGAgcgG 


801 


CATCGACCGC 


GAtttgtacg 


851 


TCGACGGCCG 


TATCCATCCG 


901 


CCTTTGGTCG 


TTGCCTACGC 


951 


AAACGACGTA 


CTCGGCGTTG 


1001 


TCTGGCCGAC 


AGACGAAGAA 


1051 


CCGCAACAAT 


TCCGCGACAT 


1101 


GCAAAAAGCA 


CCAAGCCCGC 


1151 


TCCGCCGTCC 


GCCCTATTGG 


1201 


AGAGGTATGC 


GTCCGCCGGC 


1251 


CATCTCgcca 


tCCAATGCGA 



identified in N. gonorrhoeae <SEQ ID 35 1>: 

GATGCGCCTG CCCGATATTG TCGGCGTGGA 
CGGGCATTAC TGCCACAGAC ATCGTGTTGG 
AAAGAGCGCG TGGTCGGGGC GTTTGTCGAA 
AAGCCTGTCT ATCGGCGACC GCGCGACCAT 
TCGGCGCGAC TGCCGCCATG TTCGCCATCG 
TTGAAACTGA CCGGACGTGA CGACGCGCAG 
CGCCAAAACC GCAGGCTTAT GGGCAGGTGG 
CGCGCGTTTT G/U^TTTGAT TTGAGCAGCG 
CCGAGCAACC CGCACGCGCG TTTTGCCACC 
GCTGGCGAAG CCTTACGAAG AGCCTTCAGA 
CAGTGATTAT TGCCGCGATT ACTTCGTGTA 
AACGTTGTCG CCGCCGCACT GTTGGCACGC 
GAAACGCAAA CCTTGGGTGA AATCTTCGTT 
CCGGAATCTA TTTGAAAGAA GCAGGCTTGT 
GGCTTCGGTA TCGTCGCCTT CGCATGTACC 
CGCGCTcgaC CCGAAAATCC AACAAGAAAT 
CCACCGCCGT ATTGTCAGGC AACCGCAACT 
TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 
ATTGGCAGGT AGCATCCGTT TCGATATTGA 
CAGACGGCCG CGAAATCCGC CTGAAAGATA 
ATCGATGCCA TCGTTGCCGA ATATGTGAAA 
TTATATCCCG ATGTCCGACA CCGGCACAGC 
TGTACGACTG GCGACCGATG TCCACCTACA 
GAAGGCGCAC TGGCAGGGGA ACGTACATTA 
GATTTTGCCC GACAACATCA CCACCGACCA 
TTTTGGCCGG cagTGCcgca ggtgaATATT 
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1301 TGGCGAAAAT GGGTTTGCCT GAAGAagaCT TCAACTCTTA CGCAACCCAC 

13 51 CGCGGCGACC ACTTGACCGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 
1401 GTTTAACGAA ATGGTGAGAA ACGAAGACGG CAGCGTACGC CAAGGTtcgt 
1451 tggcacgcgT tgaacCAGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTT ATCATCATTG CCGGTGCGGA 
1551 CTATGGTCAA GGCTCAAGCC GCGACTGGGC GGCGAAGGGC GTGCGGCTGG 
1601 CGGGTGTGGA AGCCATCGCC GCCGAAGGTT TCGAGCGCAT CCACCGCACC 
1651 AACCTCATCG GCATGGGCGT CTTGCCGCTG CAATTCAAAC CCGGCACCAA 
1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGGCG 
1751 AACGCACACC GCGCTGCGGC CTGACCCTCG TGATTCACCG TAAAAACGGA 
1801 GAAACCGTCG AAGTTCCGGT TACCTGCCGC CCCGATACCG CAGAAGAAGC 
1851 ATTGGTATAT GAAGCCGGCG GCGTATTGCA ACGGTTTGCA CAGGACTTTT 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 352; ORF 099.ng>: 

g099 .pep 

1 MLGRASMMRL PDIVGVELTG KRQAGITATD IVLALTEFLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDAQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWAGGLKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADIiAAKGLAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NWAAALLAR 

201 NANRLGLKRK PWVKSSFAPG SKVAGIYLKE AGLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQQEIIDR DLYATAVLSG NRNFDGRIHP YAKQ AFLASP 

301 PLWAYALAG SI RFDIENDV LGVADGREIR LKDIWPTDEE IDAIVAEYVK 

3 51 PQQFRDIYIP MSDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

401 RGMRPPAILP DNITTDHISP SNAILAGSAA GEYLAKMGLP EEDFNSYATH 

451 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE GQTMRMWEAI 

501 ETYMNRKQPL IIIAGADYGQ GSSRDWAAKG VRLAGVEAIA AEGFERIHRT 

551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCG LTLVIHRKNG 

601 ETVEVPVTCR PDTAEEALVY EAGGVLQRFA QDFLEGNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 353>: 

m099 . seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTTGA 

51 GCTGAACGGC AAACGGCAGG CGGGCATTAC GGCGACGGAT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCGATG TTCGCTATTG 

251 ATGAGCAAAC CATTGATTAT TTGAAACTGA CCGGACGCGA CGACGCGCAG 

301 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

351 CTTGAAAACC GCCGTTTATC CTCGCGTTTT GAAATTTGAT TTGAGCAGCG 

401 TAACGCGCAA TATGGCAGGC CCAAGTAACC CGCATGCCCG TTTTGCGACC 

451 GCCGATTTGG CGGCGAAAGG GCTGGCGAAG CCTTACGAAG AGCCTTCGGA 

501 CGGCCAAATG CCCGACGGCT CGGTCATCAT CGCCGCGATT ACCAGTTGCA 

551 CCAACACTTC CAACCCGCGC AACGTTGTTG CCGCCGCGCT CTTGGCACGC 

601 AATGCCAACC GTCTCGGCTT GAAACGCAAA CCTTGGGTGA AATCTTCGTT 

651 TGCCCCGGGT TCAAAAGTAG CCGAAATCTA TTTGAAAGAA GCGGGCCTGT 

701 TGCCCGAT^T GGAAAAACTC GGCTTCGGTA TCGTCGCCTT CGCCTGCACC 

751 ACCTGCAACG GCATGAGTGG CGCGCTGGAT CCGAAAATCC AGAAAGAAAT 

801 CATCGACCGC GATTTGTACG CCACCGCCGT ATTATCAGGC AACCGCAACT 

851 TCGACGGCCG TATCCACCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 

901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGTATCCGTT TCGATATTGA 

951 AAACGACGTA CTCGGCGTTG CAGACGGCAA GGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTGC CGATGAAGAA ATCGATGCCG TCGTTGCCGA ATATGTGAAA 

1051 CCGCAGCAGT TCCGCGATGT GTATGTACCG ATGTTCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCCAGTCCGC TGTACGATTG GCGTCCGATG TCCACCTACA 

1151 TCCGCCGTCC GCCTTACTGG GAAGGCGCGC TGGCAGGGGA ACGCACATTA 

1201 AGAGGTATGC GTCCGCTGGC GATTTTGCCC GACAACATCA CCACCGACCA 

1251 CCTCTCGCCG TCCAATGCGA TTTTGGCCGT CAGTGCCGCA GGCGAGTATT 

1301 TGGCGAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 

1351 CGCGGCGACC ACTTGACCGC CCAACGCGCT ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAAAA ACGAAGACGG CAGCGTGCGC CAAGGCTCGT 
14 51 TCGCCCGCGT CGAACCCGAA GGCGAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTC ATCATCATTG CCGGTGCGGA 
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1551 CTATGGTCAA GGCTCAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 

1601 CCGGCGTAGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 

1651 AACCTTATCG GCATGGGCGT GTTGCCGCTG CAGTTCAAAC CCGACACCAA 

1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTGGTCGGCG 

1751 AACGCACACC GCGCTGCGAC CTGACCCTCG TGATTCACCG TAAAAACGGC 

1801 GAAACCGTTG AAGTTCCCGT TACCTGCTGC CTCGATACTG CAGAAGAAGT 

1851 ATTGGTATAT GAAGCCGGCG GCGTGTTGCA ACGGTTTGCA CAGGATTTTT 

1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 354; ORF 099>: 

m099 .pep 

1 MLGRASMMRL PDIVGVELNG KRQAGITATD IVLALTEFLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDEQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTR^fMAG PSKTPHARFAT 

151 ADLAAKGLAK PYEEPSDGQM .PDGSVIIAAI TSCTNTSNPR NWAAALLAR 

201 NANRLGLKRK PWVKSSFAPG SKVAEIYLKE AGLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQKEIIDR DLYATAVLSG NRNFDGRIHP YAKQ AFLA5P 

301 PLWAYALAG SI RFDIENDV LGVADGKEIR LKDIWPADEE IDAWAEYVK 

351 PQQFRDVYVP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGAIiAGERTL 

401 RGMRPLAILP DNITTDHLSP SNAILAVSAA GEYLAKMGLP EEDFNSYATH 

451 RGDHLTAQRA TFANPKLFNE MVKNEDGSVR QGSFARVEPE GETMRMWEAI 

501 ETYMNRKQPL IIIAGADYGQ GSSRDWAAKG VRLAGVEAIV AEGFERIHRT 

551 NLIGMGVLPL QFKPDTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 

601 ETVEVPVTCC LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 099 shows 96.2% identity over a 639 aa overlap with a predicted ORF (ORF 099.ng) 
from N. gonorrhoeae: 

m099/g099 

10 20 30 40 50 60 

MLGRASMMRLPDIVGVELNGKRQAGITATDIVLALTEFLRKERWGAFVEFFGEGARSLS 

iiiiiiiiiiiiiiiiihiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiiii 

MLGRASMMRLPDIVGVELTGKRQAGITATDIVLALTEFLRKERWGAFVEFFGEGARSLS 
10 20 30 40 50 60 

70 80 90 100 110 120 

IGDRAT I SNMTPEFGATAAMFAI DEQT I DYLKLTGRDDAQVKLVET YAKTAGLWADALKT 

IIIIIMIIIIIIIIIIIIIIIII IIIIIIIIIIIIIIIIIIIIIIIMIIIII :||| 
XGDRATISNMTPEFGATAAMFAIDAQTIDYLKLTGRDDAQVKLVETYAKTAGLWAGGLKT 
70 80 90 100 110 120 

130 140 150 160 170 180 

AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKP YEE PSDGQMPDGS VI I AAI 

illllllllllllllllllllllllllllllllllllllillllllllllllhllllli 

AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGAVIIAAI 
130 140 150 160 170 180 

190 200 210 220 230 240 

TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSSFAPGSKVAEIYLKEAGLLPEMEKL 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIII lllllllllllllll 

TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSSFAPGSKVAGIYIiKEAGLLPEMEKL 
190 200 210 220 230 240 

250 260 270 280 290 300 

GFGIVAFACTTCNGMSGALDPKIQKEIIDRDLYATAVLSGNRNFDGRIHPyAKQAFLASP 

llllllliillllllllllllllhll Illlllllllllllllllllllllllll 

GFGIVAFACTTCNGMSGALDPKIQQEIIDRDLYATAVLSGNRNFDGRIHPYAKQAPLASP 
250 260 270 280 290 300 

310 320 330 340 350 360 



Tn099 .pep 
g099 

m099 . pep 
g099 

m099 .pep 
g099 

m099 .pep 
g099 

m099 .pep 
g099 



wo 99/57280 




PCT/US99/09346 



Tn099 . pep PLWAYALAGSIRFDIENDVLGVADGKEIRLKDIWPADEEIDAWAEYVKPQQFRDVYVP 

IllllllllllllllllilllllllhllllllllhlllllhlllllllllllhlU 
g099 PLWAYALAGSIRFDIENDVLGVADGREIRLKDIWPTDEEIDAIVAEYVKPQQFRDIYIP 
310 320 330 340 350 360 



370 380 390 400 410 420 

m099 . pep MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRO^RPLAILPDNITTDHLSP 

I lllllllllllllllllllllllllllllllllllllllllll llllllllllhll 
g099 MSDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPPAILPDNITTDHISP 
370 380 390 400 410 420 



430 440 450 460 470 480 

mO 99 . pep SNAILAVSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 

mill iiiiiiiiiiiiiiiiiiiiiiiiiiMiiiiiiiiiiiiiiiihiiiiiii 

g 0 9 9 SNAI IiAGS AAGEYLAKMGLPEEDFNS YATHRGDHLTAQRATFANPKLFNEMVRNEDGSVR 

430 440 450 460 470 480 



490 500 510 520 530 540 

m099 . pep QGSFARVEPEGETMRMWEAIETYMNRKQPLIIIAGADYGQGSSRDWAAKGVRLAGVEAIV 

llhllllllhlllllllllllllllllllllllllllllllllllllllllllllll: 
g 0 9 9 QGSLARVEPEGQTMRMWEAI ETYMNRKQPLI 1 1 AGADYGQGSSRDWAAKGVRLAGVEAI A 

490 500 510 520 530 540 



550 560 570 580 590 600 

mO 9 9 . pep AEGFERIHRTNLIC»lGVLPLQFKPDTNRHTLQLDGTETYDWGERTPRCDIiTLVIHRKNG 
llllllllllllllllllllllll llllllllllllllllllllllll llllllllll 
g099 AEGFERIHRTNLIGMGVLPLQFKPGTNRHTLQIiDGTETYDWGERTPRCGLTLVIHRKNG 
550 560 570 580 590 , 600 



610 620 630 640 

mO 9 9 . pep ETVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

lllllllll {lllhlllllllllllMIIIIIIIIII 

g 0 9 9 ETVEVPVTCRPDTAEEALVYEAGGVLQRFAQDFLEGNAAX 
610 620 630 640 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 35 5>: 
a099.seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTTGA 

51 GCTGAACGGC AAACGGAAGG CGGGCATTAC GGCGACGGAT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCGATG TTCGCTATTG 

251 ATGAGCAAAC CATTGATTAT TTGAAACTGA CCGGACGCGA CGACGCGCAG 

301 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

351 CTTGAAAACC GCCGTTTATC CGCGCGTTTT GAAATTTGAT TTGAGCAGCG 

401 TAACGCGCAA TATGGCAGGC CCGAGCAACC CGCACGCGCG TTTTGCGACC 

451 GCCGATTTGG CCGGCAAAGG CTTGGCTAAA CCTTACGAAG AGCCTTCAGA 

501 CGGCCAAATG CCTGACGGTG CAGTGATTAT TGCCGCGATT ACTTCCTGTA 

551 CCAATACTTC CAATCCGCGC AACGTTGTCG CCGCCGCGCT GTTGGCACGC 

601 AATGCCAACC GCCTCGGCTT GCAACGCAAA CCTTGGGTGA AATCTTCGTT 

651 TGCCCCGGGT TCAAAAGTAG CCGAAATCTA TTTGAAAGAA GCAGATCTGC 

701 TGCCCGAAAT GGT^AAAACTC GGCTTCGGTA TCGTTGCCTT CGCATGTACC 

751 ACCTGTAACG GCATGAGCGG CGCGCTGGAT CCGAAAATCC AG/W^GAAAT 

801 CATCGACCGC GATTTGTACG CCACCGCCGT ATTGTCAGGC AACCGCAACT 

851 TTGACGGCCG TATCCATCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 

901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGCATCCGTT TCGATATTGA 

951 AAACGACGTA CTCGGCGTTG CAGACGGCAA AGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTAC CGATGAAGAA ATCGATGCCA TCGTTGCCGA ATATGTGAAA 

1051 CCGCAGCAAT TTCGCGACGT TTATATCCCG ATGTTCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCAAGCCCGC TGTACGACTG GCGTCCAATG TCTACCTATA 

1151 TCCGCCGCCC ACCTTACTGG GAAGGCGCAC TGGCAGGGGA ACGCACATTA 

1201 AGCGGTATGC GTCCGCTGGC GATTTTGCCC GACAACATCA CCACCGACCA 
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1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 



TCTCTCGCCA 
TGGCAAAAAT 
CGTGGCGACC 
GTTTAACGAA 
TGGCACGCGT 
GAAACCTATA 



TCCAATGCGA TTTTGGCAAG CAGTGCCGCA 
GGGTTTGCCT GAAGAAGACT TCAACTCTTA 
ACTTGACCGC CCAACGCGCA ACCTTCGCCA 
ATGGTGAGAA ACGAAGACGG CAGCGTACGC 



TGAACCCGAA GGCCAAACCA TGCGCATGTG 
TGAACCGCAA ACAGCCGCTC ATCATCATTG 
GCGACTGGGC TGCAAAAGGC 
GCCGAAGGCT TCGAGCGTAT 
GTATGGGCGT GTTGCCGCTG CAGTTCAAAC 
CTGCAACTGG ACGGTACGGA AACCTACGAC 
GCGCTGCGAC CTGACCCTTG TGATTCACCG 
AAGTCCCCAT TACCTGCCGC CTCGATACCG 
GAAGCCGGTG GCGTATTGCA ACGGTTTGCA 
cdCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 356; ORF 

a099.pep 

PDIVGVELNG 
IGDRATISNM 
AGLWADALKT 
PYEEPSDGQM 
PWVKSSFAPG 



CTACGGTCAA GGCTCAAGCC 
CCGGCGTGGA AGCGATTGTT 
AACTTGATCG 
CCGCCACACC 
AACGCACACC 
GAGACCGTCG 
GTTGGTATAT 
TGGAAGGGAA 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 



KRKAGITATD 
TPEFGATAAM 
AVYPRVLKFD 
PDGAVIIAAI 
SKVAEIYLKE 



IVLALTEFLR 
FAIDEQTIDY 
LSSVTRNMAG 
TSCTNTSNPR 
ADLLPEMEKL 
NRNFDGRIHP 

LKDIWPTDEE 

PQQFRDVYIP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW 
SGMRPLAILP DNITTDHLSP SNAILASSAA GEYLAKMGLP 
RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE 
ETYMNRKQPL IIIAGADYGQ GSSRDWAAKG VRLAGVEAIV 
NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCD 
ETVEVPITCR LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* 



MLGRASMMRL 
FFGEGARSLS 
VKLVETYAKT 
ADLAGKGLAK 
NANRLGLQRK 
TCNGMSGALD 
PLWAYALAG 



PKIQKEIIDR DLYATAVLSG 
SIRFDIENDV LGVADGKEIR 



GGCGAATATT 
CGCAACCCAC 
ATCCGAAACT 
CAAGGTTCGC 
GGAAGCCATC 
CCGGCGCGGA 
GTACGCCTCG 
CCACCGCACC 
CGGGTACCAA 
GTTGTCGGCG 
TAAAAACGGC 
CAGAAGAAGT 
CAGGATTTTT 

099.a>: 

KERWGAFVE 
LKLTGRDDAQ 
PSNPHARFAT 
NVVAAALLAR 
GFGIVAFACT 
YAKQAFLASP 
IDAIVAEYVK 
EGALAGERTL 
EEDFNSYATH 
GQTMRMWEAI 
AEGFERIHRT 
LTLVIHRKNG 



m099/a099 97.5% identity in 639 aa overlap 

10 20 30 40 50 60 

mO 99 . pep MLGRASMMRLPDIVGVELNGKRQAGITATDIVLALTEFLRKERWGAFVE FFGEGARSLS 

I I I I I t I I 1 I 1 I I I I i i I I I I I : I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
a099 MLGRASMMRLPDIVGVELNGKRKAGITATDIVLALTEFLRKERWGAFVEFFGEGARSLS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m099.pep IGDRATISNMTPEFGATAAMFAIDEQTIDYLKLTGRDDAQVKLVETYAKTAGLWADALKT 
I I I I I I M I I I I I I I I i I M I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I 
a 0 9 9 I GDRAT I SNMT PE FG ATAAMFAI DEQT I D YLKLTGRDDAQVKLVET YAKTAGLWADALKT 

70 80 90 100 110 120 

130 140 150 160 170 180 

ni099.pep AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGSVIIAAI 
lllillllllllllltllllllllllllllli|[:||||il||||||MIII|:|||||| 
a099 AVYPRVLKFDLSSVTRNMAGPSNPHT^FATADLAGKGLAKPYEEPSDGQMPDGAVIIAAI 

130 140 150 160 170 180 

190 200 210 220 230 240 

m099.pep TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSSFAPGSKVAEIYLKEAGLLPEMEKL 
M I I I I I t I I I I i I I I I I I I I I I I I I I : I I I I I I I [ I I I I I i I I I I I I I I I I I I I I I I I 
a099 TSCTNTSNPRNWAAALLARNANRLGLQRKPWVKSSFAPGSKVAEIYLKEADLLPEMEKL 

190 200 210 220 230 240 

250 260 270 280 290 300 

mO 99 . pep GFGIVAFACTTCNGMSGALDPKIQKEI IDRDLYATAVLSGNRNFDGRIHPYAKQAFLASP 

I I I I ! I I I i I I I I I M I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I i I I I i I I I I I I I 
a099 GFGIVAFACTTCNGMSGALDPKIQKEIIDRDLYATAVLSGNRNFDGRIHPYAKQAFLASP 

250 260 270 280 290 300 



m099.pep 



310 320 330 340 350 360 

PLWAYALAGSIRFDIENDVLGVADGKEIRLKDIWPADEEIDAWAEYVKPQQFRDVYVP 
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I I I I i I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I : I I I I I I : I I I i I I I I I i t I I I : I 
a099 PLVVAYALAGSIRFDIENDVLGVADGKEIRLKDIWPTDEEIDAIVAEYVKPQQFRDVYIP 
310 320 330 340 350 360 

370 380 390 400 410 420 

m099.pep MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPLAILPDNITTDHLSP 
I I I t I I I I i I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
a099 MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLSGMRPLAILPDNITTDHLSP 

370 380 390 400 410 420 

430 440 450 460 470 480 

m099 . pep SNAILAVSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 
I I I I I I I I I I I I I I M I I I I I I i I I I I I I I I I I I I I I I I j I I I I I I I I I I I : I I I I I I I 
a099 SNAILASSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVRNEDGSVR 

430 440 450 460 470 480 

490 500 510 520 530 540 

m099.pep QGSFARVEPEGETMRMWEAIETYMNRKQPLIIIAGADYGQGSSRDWAAKGVRLAGVEAIV 
I I I : I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I t I I I I I I I I I f I I I I 
a099 QGSLARVEPEGQTMRMWEAIETYMNRKQPLIIIAGADYGQGSSRDWAT^GVRLAGVEAIV 

490 500 510 520 530 540 

550 560 570 580 590 600 

m099 . pep AEGFERIHRTNLIGMGVLPLQFKPDTNRHTLQLDGTETYDWGERTPRCDLTLVIHRKNG 
I I I 1 I I M I I I I I I I I I I I I t I I i I I I i I 1 I t I I I I I I I I I I I I I i I I I I I I I I I I I I j 
a099 AEGFERIHRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDVVGERTPRCDLTLVIHRKNG 

550 560 570 580 590 600 

610 620 630 640 

mO 9 9 . pep ETVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 
111111:11 I I I I I I I I i I I I I I I I i I I I i I I I I I I I I I 
a099 ETVEVPITCRLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

610 620 630 640 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 357>; 

gl02 . seq 

1 AtgtCCGCCA AAactccgtc gctcttcggc ggcgcgatga Ttatcgccgg 

51 gaaggttatc ggcgcAGgta tgttccccaa ccccaccgcc aacttggggg 

101 acgggttaat aggctcgctg attgtgctgc tgtacacctg gtttccattc 

151 tcctccggcg ccctcatgat tttggaagtc aacacccata acCCccgagg 

201 ggcaAGtttt gacaccATGg tcAAagacct gctcgGaCGc ggctggaaca 

251 tcatcaacgg catcgccgtc gctttggTCc tatacggctc gacctacgcg 

301 tacattttag tcggcggtga cctGACCGCC AAAGGCAtcg GCAgCGCAGT 

351 AGGCGGCAAA ATTTCgctca CCGTCGGACA actcgtcttc tTCGGCATCC 

401 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG CTTTACCGGC 

451 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GGTTGCCGAT GCCAAACCGT CCGTCCTCTT CGACACCCAA GCCCCCGTCG 

551 GCACCGGCTA CTGGATTTAC GCCGCCACCG CCCTGCCCGT CTGCCTCGCT 

601 TCCTTCGGCT TCCACGGCAA CGTTTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGcgcCc aaagtGgCGA aATCcatctg gGcaggtaca ttggTTGCCt 

701 tggtaattta cgtccTCTgg caaaccgcca tCcaaagcaa ccTGCcgcgc 

751 aacgagttcg cCCCcgtgat tgccgccgag aggcaactCT CCGTCCTgaa 

801 tgaaacccTG tccaaattcg cccaaaccgg cgatatggat aAaatattgt 

851 ccctatttcc ctacatggca atcgccacct cctttttagg cgTAACctta 

901 ggcctgtttg acaacatcgc cgacatcttc aaatggaacg acagtatgtc 

951 cgggcggggc accaaaaccg tcgcgctgaa cttcctgccg CCCCtgattt 

1001 cctggctgct cctccccacc ggcttcttta ccgccattgg tgcgtccggc 

1051 ctggcggcaa ccgtctggga ccaagGcatc atccccgcca tgctgctcta 

1101 cgtttccccc caaaaaattG gcGcaggcaa gacttataAa gtttaCGGCG 

1151 gcttgtggct gatgttagtc ttccttttcg gcatcgccaa catcgccgca 

1201 CAGGTATTGA GccaAatgGa ACtcgtCccc GTATTTAAAG GATAA 



This corresponds to the amino acid sequence <SEQ ID 358; ORF 102.ng>: 

gl02.pep 



wo 99/57280 




PCTAJS99/09346 



1 

51 
101 
151 
201 
251 
301 
351 
401 



MSAKTPSLFG GAMIIAGKVI GAGMFPNPTA NLGDGLIGSL IVLLYTWFPF 



SSGA LMILEV 
YILVGGDLTA 
VLIGGMVLTF 



NTHNPRGASF 
KGIGSAVGGK 
IWATGGLVAD 



SFGFHGNVSS 
NEFAPVIAAE 
GLFDNIADIF 
LAATVWDQGI 
QVLSQMELVP 



LLKYFKGDAP 
RQLSVLNETL 
KWNDSMSGRG 
IPAMLLYVSP 
VFKG* 



DTMVKDLLGR GWNIING IAV ALVLYGSTYA 
ISLTVGQLVF FGILAFC VWA SARLVDRFTG 
AKPSVLFDTQ APVGTGYWIY AATALPVCLA 
KVAKSIW AGT LVALVIYVLW QTAI QSNLPR 
SKFAQTGDMD K ILSLFPYMA lATSFLGV TL 
TKTVALNFLP P LISWLLLPT GFFTAIGA SG 
QKIGAGKTYK VYGGLWLMLV FLFGIANIAA 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 359>: 



ml02 .seq 

1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 



ATGCCCAACA 
CACGGTCATC 
TATGGTTTAC 
CTTTCCAGCG 
CGCAAGTTTC 
TCATCAACGG 
TATATCTTCG 
AGGCGGCGAC 
TCGCCTTTTG 
GTCCTTATCG 
GATTGCCGAT 
GCACAAACTA 
TCCTTCGGCT 
CGACGCGCCC 
TGGTAATTTA 
AACGAGTTCG 
CGAAACCCTG 
CCCTGTTTTC 
GGACTCTTCG 
CGGCCGCACC 
GCCTGCTCTT 
GCGGCAACCG 
GCGCAAAAAA 
GGCTGATGGT 
GTATTGAGCC 



AAACCCCTTC 
GGCGCAGGCA 
CGGCTCGCTG 
GCCTGATGAT 
GACACGATGG 
CATCGCCGTC 
TCGGCGGCGA 
GTTTCACTCA 
CGTATGGGCA 
GCGGCATGGT 
GCCAAGCCGT 
CTGGATTTAC 
TCCACGGCAA 
AAAGTGGCTA 
CGTCCTCTGG 
CCCCCGTCAT 
TCCAAATTCG 
CTATATGGCG 
ACTACATCGC 
AAAACCGCCG 
CCCCACCGGC 
TCTGGACAGG 
TTCGGCGCAG 
TTGGGTCTTC 
AAATGGAACT 



ACTGTTCGGC 
TGCTCGCCAA 
GCCGTGTTGC 
TTTGGAAGTC 
TCAAAGACCT' 
GCCTTCGTTT 
CCTGACCGCC 
CCGTCGGACA 
TCCGCACGCT 
ATTGACCTTT 
CCGTCCTCTT 
GCCGCCACCG 
CGTCTCCAGC 
AATCCATCTG 
CAAACCGCCA 
CGCCGCCGAA 
CCCAAACCGG 
ATCGCCACCT 
CGACATCTTC 
CGCTGACCTT 
TTCGTTACCG 
CATCATCCCC 
GCAAAACCTA 
CTTTTCGGCA 
CGTCCCCGTA 



GGCGCGATGA 
CCCGACCGCC 
TGTACACCTG 
AACACCCATT 
GCTCGGACGC 
TATACCTGCT 
AAAGGCTTAG 
ACTCGTCTTC 
TGGTCGACCG 
ATTTGGGCGG 
CGATACCCAA 
CCCTGCCCGT 
CTGCTCAAAT 
GACGGGCACA 
TCCAAGGCAA 
GGGCAAGTCT 
CAATATGGAC 
CGTTTTTAGG 
AAATGGAACG 
CCTGCCGCCC 
CCATCGGCTA 
GCCATGCTGC 
TAAAGTTTAC 
TCGTCAACAT 
TTTAAAGGAT 



TTATCGCCGG 
ACATCCGGCG 
GTTTTCTATG 
ATCCGCACGG 
GGCTGGAACA 
TACTTACGCT 
GCAGCGCGGC 
TTCGGCATCC 
CTTCACCGGC 
CCGGCGGGCT 
GCCCCCGCCG 
CTGCCTCGCT 
ACTTTAAAGG 
CTGATTGCGC 
CCTGCCGCGC 
CCGTCCTCAT 
AAAATATTGT 
CGTAACGCTC 
ACAGCATCTC 
CTGATTTCCT 
CGTCGGCCTG 
TCTACCGTTC 
GGCGGCTTGT 
CGCCGCACAG 
AA 



This corresponds to the amino acid sequence <SEQ ID 360; ORF 102>: 

iiil02.pGp. . 

1 MPNKTPSLFG GAMIIAGTVI GAGMLANPTA TSGVWFTGSL AVLLYTWFSM 



51 LSSGLMILEV 
101 YIFVGGDLTA 
151 VLIGGMVLTF 



NTHYPHGASF 
KGLGSAAGGD 
IWAAGGLIAD 



201 SFGFHGNVSS 

251 NEFAPVIAAE 

301 GLFDYIADIF 

351 AATVWTGII P 

401 VLSQMELVPV 



LLKYFKGDAP 
GQVSVLIETL 
KWNDSISGRT 
AMLLYRSRKK 
FKG* 



DTMVKDLLGR GWNII NGIAV AFVLYLLTYA 
VSLTVGQLVF FGILAFC VWA SARLVDRFTG 
AKPSVLFDTQ APAGT NYWIY AATALPVCLA 
KVAKS IWTGT LIALVIYVLW QT AIQGNLPR 
SKFAQTG NMD KILSLFSYMA lATS FLGVTL 
KTAALTFLPP LISCLL FPTG FV TAIGYVGL 
FGAGKTYKV Y GGLWLMVWVF LFGIVN IAAQ 



Iia02/gl02 86.0% identity in 415 aa overlap 

10 20 30 40 50 60 

ml 02. pep MPNKTPSLFGGAMIIAGTVIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 
I IIIIIIIIMIIIi illlil: MM: I : I I I I I I I I I I : I :: I I I I t I 
gl02 MSAKTPSLFGGAMIIAGKVIGAGMFPNPTANLGDGLIGSLIVLLYTWFPFSSGALMILEV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 02. pep NTHYPHGASFDTMVKDLLGRGWNIINGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGD 

III I : I I I I I I I I I t I I I I I I II I I I I I I I : I I I I I I II : I I I I I I II I : I II : I I 
gl02 NTHNPRGASFDTMVKDLLGRGWNIINGIAVALVLYGSTYAYILVGGDLTAKGIGSAVGGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml02 . pep VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTQ 
:||lllllltlllllllllllllllllllllllllllllllll:|||:|||||||||||| 
gl02 ISLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWATGGLVADAKPSVLFDTQ 

130 140 150 160 170 180 



ml 02 .pep 



190 200 210 220 230 240 

APAGTNYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 



wo 99/57280 
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I I : I I : I I I i I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I M I : I I I : I I I I I I I I 
gl02 APVGTGYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWAGTLVALVIYVLW 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml 02 . pep QTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 
I I I I I : I I I I I I I I I I I I I I I : I I I I I I I I I I I I I : I I i I I I I I I I I I I I I I I I I I I 
gl02 QTAIQSNLPRNEFAPVIAAERQLSVLNETLSKFAQTGDMDKILSLFPYMAIATSFLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 

ml02 . pep GLFDYIADIFKWNDSISGR-TKTAALTFLPPLISCLLFPTGFVTAIGYVGLAATVWT-GI 

I I I I 1111111111:111 I I I : I I : I I I I I I I I I : I I I I I I I I I I I I I I I II 
gl02 GLFDNIADIFKWNDSMSGRGTKTVALNFLPPLISWLLLPTGFFTAIGASGLAATVWDQGI 

310 320 330 340 350 360 

360 370 380 -390 400 410 

ml 02 .pep IPAMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIVNIAAQVLSQMELVPVFKGX 

II I II I I I : I : I I I I I I i I I I I I I I I : I I I I II : I I I I I I II I I I I II I I I I I 
gl02 IPAMLLYVSPQKIGAGKTYKVYGGLWLML-VFLFGIANIAAQVLSQMELVPVFKGX 

370 380 390 400 410 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 36 1>: 

al02.seq 

1 ATGCCCACCA AAACCCCTTC ACTGTTCGGC GGCGCGATGA TTATCGCCGG 

51 CACGNTCATC GGCGCAGGTA TGCTCGCCAA CCCGACCGCC ACATCCGGCG 

101 TATGGTTTAC CGGCTCGCTG GCCGTGTTGC TGTACACCTG GTTTTCCATG 

151 CTCTCCAGCG GCCTGATGAT TTTGGAAGTC AACACCCACT ACCCCCACGG 

201 CGCGANCTTC GACACCATGG TTAAAGACCT GCTCGGACGG AGCTGGAACA 

251 TCATCAACGG CATCGCCGTC GCCTTCGTTT TATACCTGCT TACTTACGCT 

301 TATATCTTCG TCGGCGGCGA CCTGACCGCC AAAGGCTTAG GCAGCGCGGC 

351 AGGCGGCAAT GTTTCACTCA CCGTCGGACA ACTCGTCTTC TTCGGCATTC 

401 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG ATTCACCAGC 

4 51 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GATTGCCGAT GCCAAACTGC CCGTCCTCTT CGACACCCAA GCCCCTACCG 

551 GCACCAACTA CTGGATTTAT GTCGCCACCG CCCTGCCCGT CTGCCTTGCG 

601 TCATTCGGTT TCCACGGCAA CGTCTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGCGCCC AAAGTGGCTA AATCCATCTG GACGGGCACA CTGATTGCGC 

701 TGGTAATTTA CGTCCTCTGG CAAACCGCCA TCCAANGCAA CCTGCCGCGC 

751 AACGAGTTCG CCCCCGTGAT TGCCGCCGAA GGGCAAGTCT CCGTCNTGAT 

801 TGAAACCCTG TCCAAATTCG CCCAAACCGG CAATATGGAC AAAATATTGT 

851 CCCTGTTTTC CTATATGGCG ATCGCCACCT CGTTTTTAGG CGTAACGCTC 

901 GGACTCTTCG ACTACATCGC CGACATCTTC AAATGGAACG ACAGCGTGTC 

951 CGGCCGCACC AAAACCGCCG CGCTGACCTT CCTGCCGCCT NTAATTTCCT 

1001 GCCTGCTCTT CCCCACCGGC TTTGTTACCG CCATCGGNTA CGTCGGCCTG 

1051 GCGGCAACCG TCTGGACAGG CATCATCCCC GCCATGCTGC TNTACCGTTC 

1101 GCGCAAAAT^ TTCGGCGCAG GCAAAACCTA TAAAGTTTAC GGCGGCTTGT 

1151 GGCTGATGGT TTGGGTCTTC CTTTTCGGCA TCNTCAACAT CGCCGCACAN 

1201 GTATTGAGCC AAATGGAACT CGTCCCCGTA TTTAAAGGAT AA 
1202 

This corresponds to the amino acid sequence <SEQ ID 362; ORF 102.a>: 

al02 .pep 

1 MPTKTPSLFG GAMIIAGTXI GAGMLANPTA TSGVWFTGSL AVLLYTWFSM 

51 LSSGLMILEV NTHYPHGAXF DTMVKDLLGR SWNIINGIAV AFVLYLLTYA 

101 YIFVGGDLTA KGLGSAAGGN VSLTVGQLVF FGILAFCVWA SARLVDRFTS 

151 VLIGGMVLTF IWATGGLIAD AKLPVLFDTQ APTGTNYWIY VATALPVCLA 

201 SFGFHGNVSS LLKYFKGDAP KVAKSIWTGT LIALVIYVLW QTAIQXNLPR 

251 NEFAPVIAAE GQVSVXIETL SKFAQTGNMD KILSLFSYMA lATSFLGVTL 

301 GLFDYIADIF KWNDSVSGRT KTAALTFLPP XISCLLFPTG FVTAIGYVGL 

351 AATVWTGIIP AMLLYRSRKK FGAGKTYKVY GGLWLMVWVF LFGIXNIAAX 

4 01 VLSQMELVPV FKG* 



ml02 / al02 



95.9% identity in 413 aa overlap 
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312 



10 20 30 40 50 60 

inl02 . pep MPNKTPSLFGGAMIIAGTVIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 
I I: I I I I I t I I I I I I M I I I I I I I I I I I I I I I I I I I I M I i I I I I I i I I M I I I I I I i I 
al02 MPTKTPSLFGGAMIIAGTXIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 

10 20 30 40 50 60 



70 80 90 100 110 120 

inl02 . pep NTHYPHGASFDTMVKDLLGRGWNIINGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGD 
I I I I I I I i I I I I I M I I I I : I I I I I I I I I i I I I i I I I I I I I I I I I I I I I I I I I I I I I I : 
al02 NTHYPHGAXFDTMVKDLLGRSWNIINGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGN 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml02 . pep VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTQ 
I I t I I I I I I I I I I I I I I I I I I I I t I I I I I : I I I I I I M I I I I I : I I I i I I I i I I I I I I 
al02 VSLTVGQLVFFGILAFCVWASARLVDRFTSVLIGGMVLTFIWATGGLIADAKLPVLFDTQ 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml02 . pep APAGTNYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 
I I : I I i I I I I : i I t I M I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I 
al02 APTGTNYWIYVATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml02 . pep QTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 
I 1 I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I i I I I I i I I i I I I I I I I I I I t I I I I I I I I 
al02 QTAIQXNLPRNEFAPVIAAEGQVSVXIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml02 . pep GLFDYIADIFKWNDSISGRTKTAALTFLPPLISCLLFPTGFVTAIGYVGLAATVWTGIIP 
I I I I I I I I I i I t I I I : I I I I I I I I I I I t I I I I I I i I I t I i I I I I I ] I I I I I I I I I I I I I 
al02 GLFDYIADIFKWNDSVSGRTKTAALTFLPPXISCLLFPTGFVTAIGYVGLAATVWTGIIP 

310 320 330 340 350 360 



370 380 390 400 410 

ml 02 . pep AMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIVNI AAQVLSQMELVPVFKGX 

I t I I I M I I I I I I I I I I I I I I 1 t I I I i I I I I I I ) I I I I I I I I I I M I t I I I I 
al02 AMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIXNIAAXVLSQMELVPVFKGX 

370 380 390 400 410 



The following partial DNA sequence 



gl05 . seq 






1 


Atgtccgcag 


aaaCATACAc 


51 


gGgtctgcct 


atgGTAACGC 


101 


TATACAACCG 


CTCGCCCGAC 


151 


AAAGTTTACG 


GCagcACCGC 


201 


CCTGATGGTT 


TCCGACTATG 


251 


GCGACGGATT 


GGCCGGCAAA 


301 


ACCGAAAACC 


TCGCCGTCAA 


351 


TGCCGAAGCA 


CCCGTTTCCG 


401 


TGCTGATTCT 


GTTCGGCGGC 


451 


ATATTTTCCC 


TTGTCGGCAA 


501 


AGGCTCGGGC 


GCGAAACTCG 


551 


AAGCGTACAG 


CGAAGCGATG 


601 


GACACCATCG 


TCGAAGCCAT 


651 


TCAAACAAAA 


AAATCACTAT 


701 


CACTCAAACA 


CGCTTCCAAA 


751 


CAGGCAGGCA 


ACACCCTGCC 


801 


CAAAGCAGTT 


GAAGCCGGCT 


851 


TGAAATTGGC 


AGAACACTGA 



identified in N. gonorrhoeae <SEQ ID 363>: 

acAAAtcggc tGGgtaggct taggGcaaat 
GGCTCTTGGA CXSGCGGCATC GAAGTCGGCG 
AAAACTGCCC CCATCTCcgc CAAAGGAGCA 
CGAACTCGTC CGCGCCTGCC CCGTCATTTT 
CCGCCGTGTG CGACATCCTG AACGGAGTCC 
ATCATCGTCA ACATGAGCAC CATCTCCCCG 
AGCACTTGTC GAAGCCGCAG GCGGACAGTT 
GATCGGTCGG ACCCGCCACC AACGGCACAC 
AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 
AAAAACCTTC CATTTCGGCG ATGTCGGCAA 
TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 
CTGATGGCGC GGCAGTTCGG CATCGATACC 
CGGCGGCTCG GCAATGGACT CGCCTATGTT 
GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 
GACCtTAACC TCGccgtcAA AGAGCTTGAA 
CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 
ACGGCGAACA GGACGTTTCC GGCGTTTACC 
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This corresponds to the amino acid sequence <SEQ ID 364; ORF 105 .ng>: 

glOS.pep 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RAC PVIFIW SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 365>: 



ml05 . seq 












1 


ATGTCCGCAA 


ACGAATACGC 


ACAAATCGGC 


TGGaTAGGCT 


TAGGGCAAAT 


51 


GGGTCTGCCT 


ATGGTAACGC 


GGCTCTTGGA 


CGGCGGCATC 


GAAGTCGGCG 


101 


TATACAACCG 


CTCGCCCGAC 


AAAACTGCCC 


CCATCTCCGC 


CAAAGGCGCA 


151 


AAAGTTTACG 


GCAACACCGC 


-CGAACTCGTC 


CGCGACTATC 


CCGTCATTTT 


201 


CCTGATGGTT 


TCCGACTATG 


CCGCCGTGTG 


CGACATCCTG 


AACGGAGTCC 


251 


GCGACGGATT 


GGCCGGCAAm 


ATCATCGTCA 


ACATGAGCAC 


CATCTCCCCG 


301 


ACCGAAAaGC 


TCGCCGTCAA 


AGCACTTGTC 


GAAGCGCAGm 


GaCAGTTTGC 


351 


CGAAGCACCC 


GTTTCCGGAT 


CGGTCGGGCC 


CGCCACCAAC 


GGCACGCTGC 


401 


TGATTCTGTT 


CGGCGGCAGC 


GAAcCGtTTT 


AAACCCGCTG 


CAAAAAATAT 


451 


TTTCCCTCGT 


CGGCAAAAAA 


ACCTTCCATT 


TCGGCGATGT 


CGGCAAAGGT 


501 


TCGGGCGCGA 


AACTCGTCTT 


GAACTCGCTC 


TTGGGCATTT 


TCGGCGAaCG 


551 


TACAGCGAAS 


GmTgCTGATG 


GCGCGGCAGT 


TCGGCATCGA 


TACCGACACC 


601 


ATCGTCGAAG 


CCATCGGsGA 


CTCGGCAATG 


GACTCGCCCA 


TGTTCCAAAC 


651 


CAAAAAATCC 


CTGTGGGCAA 


ACCGCGAATT 


CCCGmCCGmC 


TTCGCCCTCA 


701 


AACACGCCTC 


CAAAGACCTC 


AACCTCGCCG 


TCAAAGAGCT 


TGAACAGGCA 


751 


GGCAACACCC 


TGCCCGCCGT 


CGAAACCGTT 


GCTGCCAGCT 


ACCGCAAAGC 


801 


AGTCGAAGCC 


GGCTACGGGA 


CACAGGACGT 


TTCCGGCGTT 


TACCTGAAAC 


851 


TGGCAGAACA 


CTGA 









This corresponds to the amino acid sequence <SEQ ID 366; ORF 105>; 

ml 05 .pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGX IIVNMSTISP 

101 TEKLAVKALV EAQRQFAEAP VSGSVGPATN GTLLILFGGS EPFXTRCKKY 

151 FPSSAKKPSI SAMSAKVRAR NSSXTRSWAF SANVQRXXLM ARQFGIDTDT 

201 IVEAIGDSAM DSPMFQTKKS LWANREFPXX FALKHASKDL NLAVKELEQA 

251 GNTLPAVETV AASYRKAVEA GYGTQDVSGV YLKLAEH 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF firom A^, gonorrhoeae 

ORF 105 shows 79.9% identity over a 289 aa overlap v^th a predicted ORF (ORF 105.ng) 
firom A^. gonorrhoeae: 
ml05/gl05 



10 20 30 40 50 60 

gl05 , pep MSAETYTQIGWVGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGSTAELV 

ill: hlll|:|llillllllllllllllllllllllllllllll|llllll|:||||| 
ml05 MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
10 20 30 40 50 60 



70 80 90 100 110 120 

gl05 . pep RACPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 

I IIIIIIIIIIIIIIIIIIIIIIIMI lllllllllllhlllllllll IIIM 

ml05 RDYPVI FLMVSDYAAVCDILNGVRDGLAGXI IVNMSTISPTEKLAVKALVEAQR-QFAEA 

70 80 90 100 110 



130 140 150 160 170 180 

gl05 . pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

IIIIIIIIIIIIIIIIIIMII : :| I :|| :: :: |: 
ml05 PVSGSVGPATNGTLLILFGGSEPFXTRCKKYFPSSAKKP-SISAMSAKVRARNSSXTRSW 
120 130 140 150 160 170 
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190 200 210 220 230 240 

g 1 0 5 . pep I FGEAYSEAMLMARQFG IDTDTI VEAI GGS AMDS PMFQTKKSLWANREFP PAFALKHAS K 

h IIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIII II INI I I 

ml05 AFSANVQRXXLMARQFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPXXFALKHASK 
180 190 200 210 220 230 



250 260 270 280 289 

gl05 . pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEH 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI lllllllllllll 

ml 0 5 DLNLAVKELEQAGNTLPAVETVAAS YRKAVEAGYGTQDVSGVYLKLAEH 

240 250 260 270 280 



The following partial DNA sequence was identified in M meningitidis <SEQ ID 367>: 

al05 . seq 

1 ATGTCCGCAA ACGAATACAC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAMCTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT .TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

4 51 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCA CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CT^GCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 368; ORF 105.a>: 

alOS.pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

ml05/al05 96.5% identity in 289 aa overlap 



10 20 30 40 50 60 

mlOS.pep MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 

iiiiiiuiiiiiiiiiiiiiiiiiiiiMiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

al05 MSANEYTQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
10 20 30 40 50 60 

70 80 90 100 110 119 

ml05 . pep RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAG-QFAEA 

I I I i I It I I I I I I I I I I I M I I I I I I I I I I I I I ] ) Mill 

al05 RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 
70 80 90 100 110 120 

120 130 140 150 160 170 179 

mlOS.pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
M I i I I (I M I M I I I M I I 1 I I I I I I t I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I 
al05 PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
130 140 150 160 170 180 



180 



190 200 210 220 230 
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mlOS . pep IFGDV-QRXML^4ARQFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPXAFALKHASK 
III:: : : II M I I I I I I I I II I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I 
al05 IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 
190 200 210 220 230 240 



240 250 260 270 280 

ml 05 . pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I II II i I M I I 
al05 DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
250 260 270 280 290 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 369>: 

gl05-l .seq 

1 ATGTCCGCAG AAACATACAC ACAAATCGGC TGGGTAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGAGCA 

151 AAAGTTTACG GCAGCACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCTATGTT 

651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 

701 CACTCAAACA CGCTTCCAAA GACCTTAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTT GAAGCCGGCT ACGGCGZ^CA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 



This corresponds to the amino acid sequence <SEQ ID 370; ORF 105-l.ng>: 

gl 05-1. pop 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RACP VIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 



The following partial DNA sequence was identified in A'^ meningitidis <SEQ ID 371>: 

mlOS-l.seq 

1 ATGTCGGCAA ACGAATACGC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCcG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAnCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGsGACTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCG CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAACTGGC AGAACACTGA 



This corresponds to the amino acid sequence <SEQ ID 372; ORF 105-1>: 

ml05-l.pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 
51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 
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101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEXM LMARQFGIDT 

201 DTIVEAIGDS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 



ml05-l/gl05-l 96.9% identity in 289 aa overlap 

10 20 30 40 50 60 

ml 0 5 - 1 . pep MSANEYAQIGWI GLGQMGLPMVTRLLDGGI EVGV YNRS PDKTAPI S AKGAKV YGNTAELV 
III: |:||||:|tllllllllllllllllllllllltllllMIIIIIIIIII:lllll 
gl05-l MSAETYTQIGWVGLGQMGLPMVTRLLDGGIEVGVYNRS PDKTAPI SAKGAKVYGSTAELV 

10 20 30 40 50 60 



70 80 90 100 110 120 

inlOS-l .pep RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 
I I I I I I I I II I t I I I 1 I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I II II I II 
g 1 0 5 - 1 RACPVI FLMVS DYAAVCDI LNG VRDGLAGKI I VNMSTI S PTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml05-l.pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
II II II IN I II M I I I I I II 1 II I II II II I I I I I I I I I I I I II t II I M I I II II I I I 
gl06-l PVSGSVGPATNGTLLILFGGSEAVLNPLQKXFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 05-1. pep I FGEAYSEXMLMARQFG I DTDT I VEAIGDSAMDSPMFQTKKSLWANREFP PAFALKHASK 
I I I t I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I t I I I I I II I I I 
gl05-l IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 



250 260 270 280 290 

ral05-l .pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I 1 I I I I I I I I I [ I I I I I I I I I I I I I I I I I 1 II I I I 1 I I I I I I I I I I I I I I 
gl05 - 1 DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 373>: 

al05-l.seq 

1 ATGTCCGCAA ACGAATACAC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCA CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 



This corresponds to the amino acid sequence <SEQ ID 374; ORF 105-1. a>: 

al05-l.pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV 5DYAAVCDIL NGVRDGLAGK I I VNMSTI SP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

alOS-l/mlOS-l 99.0% identity in 289 aa overlap 

10 20 30 40 50 60 

al 05-1. pep MSANEYTQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
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ml 05-1 MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
10 20 30 40 50 60 

70 80 90 100 110 120 

a 1 05 - 1 . pep RDY PVI FLMVSDYAAVCDILNGVRDGLAGKI I VNMSTI S PTENLAVKALVEAAGGQFAEA 

1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 n I I I I I I I I I I I I I M I I I I I I 1 I I I I I I I I I I I 

ml 0 5 - 1 RDYPVI FLMVSDYAAVCDI LNGVRDGLAGKI I VNMST I S PTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

al05-l .pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
I I I I ) I I I I I I t I I I I I I I t I I I I I I I I I I ) I I I I I I I I I I i i I t I I I I I I I 1 I I I I I I I 
ml 05-1 PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 

190 200 210 220 230 240 

al 05-1. pep IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 
I I I I I I I I I I I I I I I I I t I I I I I I I I i 1 I I i I i I I I I ) I I I I I I t I I I I I I I I I I I I I 
ml 05-1 IFGEAYSEXMLMARQFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 

250 260 270 280 290 

a 1 0 5 - 1 . pep DLNLAVKELEQAGNTLPAVETVAAS YRKAVEAG YGEQDVSGV YLKLAEHX 
I I I I I I ) I I I I I I I t I I I I I I I I I I I I i I I I I I I I I I I I I I I I 1 I I I I I I 
ml 05-1 DLNLAVKELEQAGNTLPAVETVAAS YRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 290 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 375>: 

gl07 . seq 

1 ATGGTATTAA CCTTTATTTG GGCAACCGGC GGCCTGGTTG CCGATGCCAA 

51 ACCGTCCGTC CTCTTCGACA CCCAAGCCCC CGTCGGCACC GGCTACTGGA 

101 TTTACGCCGC CACCGCCCTG CCCGTCTGCC TCGCTTCCTT CGGCTTCCAC 

151 GGCAACGTTT CCAGCCTGCT CAAATACTTT AAAGGCGACG cgcCcaaagt 

201 GgCGAaATCc atctggGcag gtacattggT TGCCttggta atttacgtcc 

251 TCTggcaaac cgccatCcaa agcaaccTGC cgcgcaacga gttcgcCCCc 

301 gtgattgccg ccgagaggca actCTCCGTC CTgaatgaaa cccTGtccaa 

351 attcgcccaa accggcgata tggataAaat attgtcccta tttccctaca 

401 tggcaatcgc cacctccttt ttaggcgTAA Ccttaggcct gtttgacaac 

451 atcgccggac atcttcaaat ggaacgacag tatgtccggg cggcaccaaa 

501 accgtcgcgc tga 

This corresponds to the amino acid sequence <SEQ ID 376; ORF 107.ng>: 

gl07 .pep 

1 MVLTFIWATG GLVADA KPSV LFDTQAPVGT GYWIYAATAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IW AGTLVALV lYVLWQTAI Q SNLPRNEFAP 

101 VIAAERQLSV LNETLSKFAQ TGDMD KIL5L FPYMAIATSF LGVT LGLFDN 

151 lAGHLQMERQ YVRAAPKPSR * 

The following partial DNA sequence was identified in meningitidis <SEQ ID 377>: 

ml07 . seq 

1 ATGGTATTGA CCTTTATTTG GGCGGCCGGC GGGCTGATTG CCGATGCCAA 

51 GCCGTCCGTC CTCTTCGATA CCCAAGCCCC CGCCGGCACA AACTACTGGA 

101 TTTACGCCGS CACCGCCCTG CCCGTCTGCC TCGCTTCCTT CGGCTTCCAC 

151 GGCAACGTCT CCAGCCTGCT CAAATACTTT AAAGGCGACG CGCCCAAAGT 

201 GGCTAAATCC ATCTGGACGG GCACACTGAT TGCGCTGGTA ATTTACGTCC 

251 TCTGGCAAAC CGCCATCCAA GGCAACCTGC CGCGCAACGA GTTCGCCCCC 

301 GTCATCGCCG CCGAAGGGCA AGTCTCCGTC CTCATCGAAA CCCTGTCCAA 

351 ATTCGCCCAA ACCGGCAATA TGGACAAAAT ATTGTCCCTG TTTTCCTATA 

401 TGGCGATCGC CACCTCGTTT TTAGGCGTAA CGCTCGGACT CTTCGACTAC 

451 ATCGCCCATC TTCAAATGGA ACGACAGCAT CTCCGGgCCG CACCAAAACC 

501 GCCGCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 378; ORF 107>: 

ml07 .pep. . 

1 MVLTFIWAAG GLIADA KPSV LFDTQAPAGT NYWIYAXTAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IW TGTLIALV lYVLWOTAI O GNLPRNEFAP 
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101 VIAAEGQVSV LIETLSKFAQ TGNMD KILSL FSYMAIATSF LGV TLGLFDY 
151 lAHLQMERQH LRAAPKPPR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. ^gonorrhoeae 

ORF 107 shows 89.4% identity over a 170 aa overlap with a predicted ORF (ORF 107.ng) 
from A^. gonorrhoeae: 

ml07/gl07 

10 20 30 40 50 60 

MVLTFIWAAGGLIADAKPSVLFDTQAPAGTNYWIYAXTALPVCLASFGFHGNVSSLLKYF 

IlillllhllhlllllllllllMhIhIIIII lllllllllllllllllilllll 

MVLTFIWATGGLVADAKPSVLFDTQAPVGTGYWIYAATALPVCLASFGFHGNVSSLLKYF 
10 2Q 30 . 40 50 60 

70 80 90 100 110 120 

KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 
IIIIIMIIII|:|lhllllllllllllhllllllllllllll hill llllllll 
KGDAPKVAKSIWAGTLVALVIYVLWQTAIQSNLPRNEFAPVIAAERQLSVLNETLSKFAQ 
70 80 90 100 110 120 

130 140 150 160 170 

TGNMDKILSLFSYMAIATSFLGVTLGLFDYIA-HLQMERQHLRAAPKPPR 

Ihllllllll lllllllllllllllll II llllllhHIIIII I 

TGDMDKILSLFPYMAIATSFLGVTLGLFDNIAGHLQMERQYVRAAPKPSR 
130 140 150 160 170 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 379>: 

al07 . seq 

1 ATGGTATTAA CCTTTATTTG GGCAACCGGC GGCCTGATTG CCGATGCCAA 

51 ACTGCCCGTC CTCTTCGACA CCCAAGCCCC TACCGGCACC AACTACTGGA 

101 TTTATGTCGC CACCGCCCTG CCCGTCTGCC TTGCGTCATT CGGTTTCCAC 

151 GGCAACGTCT CCAGCCTGCT CAAATACTTT AAAGGCGACG CGCCCAAAGT 

201 GGCTAAATCC ATCTGGACGG GCACACTGAT TGCGCTGGTA ATTTACGTCC 

251 TCTGGCAAAC CGCCATCCAA GGCAACCTGC CGCGCAACGA GTTCGCCCCC 

301 GTGATTGCCG CCGAAGGGCA AGTCTCCGTC CTGATTGAAA CCCTGTCCAA 

351 ATTCGCCCAA ACCGGCAATA TGGACAAAAT ATTGTCCCTG TTTTCCTATA 

401 TGGCGATCGC CACCTCGTTT TTAGGCGTAA CGCTCGGACT CTTCGACTAC 

4 51 ATCGCCGACA TCTTCAAATG GAACGACAGC GTGTCCGGCC GCACCAAAAC 

501 CGCCGCGCTG ACCTTCCTGC CGCCTCTAAT TTCCTGCCTG CTCTTCCCCA 

551 CCGGCTTTGT TACCGCCATC GGCTACGTCG GCCTGGCGGC AACCGTCTGG 

601 ACAGGCATCA TCCCCGCCAT GCTGCTCTAC CGTTCGCGCA AAAAATTCGG 

651 CGCAGGCAAA ACCTATAAAG TTTACGGCGG CTTGTGGCTG ATGGTTTGGG 

701 TCTTCCTTTT CGGCATCGTC AACATCGCCG CACAGGTATT GAGCCAAATG 

751 GAACTCGTCC CCGTATTTAA AGGATAA 

This corresponds to the amino acid sequence <SEQ ID 380; ORF 107.a>: 

alOV.pep 

1 MVLTFIWATG GLIADTiKLPV LFDTQAPTGT NYWIYVATAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IWTGTLIALV lYVLWQTAIQ GNLPRNEFAP 

101 VIT^GQVSV LIETLSKFAQ TGNMDKILSL FSYMAIATSF LGVTLGLFDY 

151 lADIFKWNDS VSGRTKTAAL TFLPPLISCL LFPTGFVTAI GYVGLAATVW 

201 TGIIPAMLLY RSRKKFGAGK TYKVYGGLWL MVWVFLFGIV NIAAQVLSQM 

251 ELVPVFKG* 



ml07 .pep 
gl07 

ml07 .pep 
gl07 

ml07 .pep 
gl07 



ml07/al07 94.8% identity in 154 aa overlap 

10 20 30 40 50 60 

ml 07 .pep MVLTFIWAAGGLIADAKPSVLFDTQAPAGTNYWIYAXTALPVCLASFGFHGNVSSLLKYF 
I I I M I I I : I I I I I I I I 11111111:1111111: I I I I I I t I I I I I I M I I I I M f I 
al07 MVLTFIWATGGLIADAKLPVLFDTQAPTGTNYWIYVATALPVCLASFGFHGNVSSLLKYF 

10 20 30 40 50 60 
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70 80 90 100 110 120 

ml07 . pep KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 
I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I t I I I I I I I I I I I It I I I I I I I M I I I 
al07 KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 

70 80 90 100 110 120 

130 140 150 160 170 

ml 07 .pep TGNMDKILSLFSYMAIATSFLGVTLGLFDYIAHLQMERQHLRAAPKPPRX 

I i I I j I I I I I I I I I I I I I I I I I I I I I I I M I I : 
al07 TGNMDKILSLFSYMAIATSFLGVTLGLFDYIADIFKWNDSVSGRTKTAALTFLPPLISCL 

130 140 150 160 170 180 

al07 LFPTGFVTAIGYVGL7UVTVWTGIIPAMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIV 
190 200 210 220 230 240 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 381>: 

gl08 . seq 

1 ATGttgccgg gCTTCAACCG GATATTCAaa cggTTTGCTC CAACACTCGG 

51 AAcggCGCAT AAAACGCCgc CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATCCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGCG GCAATCTTAC TCGGCGGGTG 

201 CGCCGCCGGC GGCAACACAT TCGGCAGCTT AGACGGCGGC ACGGGTATGG 

251 GTGGCAGCAT CGTCAAAATG ACGGTAGAAA gccAATGCCXS TGCGGAATTG 

301 GACAGGCGCA GCGAATGGCG TTTGACCGCG CTGGCGATGA GTGCCGAAAA 

351 ACAGGCGGAA TGGGAAAACA AGATTTGCGG CTGCGCTACC GAAGAAGCAC 

401 CTAACCAGCT GACCGGCAAC GATGTGATGC AGATGCTGAa CCagtccacG 

451 CGCaatcagg cacTtgccgc CCtgaccgTC AAAacggtTT CcgcctgcTT 

501 CAaacgcctg tACCGCTAa 

This corresponds to the amino acid sequence <SEQ ID 382; ORF 108.ng>: 

gl08 .pep 

1 MLPGFNRIFK RFAPTLGTAH KTPPFALSRT GRLIRSYRHK RRGFNRKGIE 

51 MNKT LSILPA AILLGGCAAG GN TFGSLDGG TGMGGSIVKM TVESQCRAEL 

101 DRRSEWRLTA LAMSAEKQAE WENKICGCAT EEAPNQLTGN DVMQMLNQST 

151 RNQALAALTV KTVSACFKRL YR* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 383>: 

ml08 . seq 

1 ATGTTGCCGG GCTTCAACCG GATATTCAAA CGGTTTGTTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATG7\ATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

201 CGCCGCCGGA GGCGGTAACA CATTCGGCAG CTTAGACGGT GGCACAGGCA 

251 TGGGCGGCAG CATCGTCAAA ATGGCGGTTG GGAGCCAATG CCGTGCGGAA 

301 TTGGACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

351 AAAACAGGCG GAGTGGGAAA ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 

401 CACCCGAACG GATGACCGGC AACGATGTGA TGCAGATGCT GGCTCCGTCC 

451 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 

501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 384; ORF 108>: 

mlOB .pep 

1 MLPGFNRIFK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 

51 MNK TLSILPV AILLGGCAAG G GNTFGSLDG GTGMGGSIVK MAVGSQCRAE 

101 LDKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPERMTG NDVMQMLAPS 

151 TRNQALAALT AKTVSACFKH LYR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 108 shows 89.6% identity over a 173 aa overlap with a predicted ORF (ORF lOS.ng) 
from A^. gonorrhoeae: 

ml08/gl08 
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10 20 30 40 50 60 

ml08 . pep MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 

illllllllllhllllllllllllllllllllll IMIIIIIIIIIIIIIIIIIIIh 

g 1 0 8 MLPGFNRI FKRFAPTLGTAHKT PPFALS RTGRLI RS YRHKRRGFNRKGI EMNKTLS I LPA 

10 20 30 40 50 60 

70 80 90 100 110 120 

mlO 8 . pep AILLGGCAAGGGNTFGSLDGGTGMGGS IVKMAVGSQCRAELDKRSEWRLTALAMSAEKQA 

IIIIIIIMII llllllllllllllllllhl llillllhlllllllllllllllll 

gl08 AILLGGCAAGG-NTFGSLDGGTGMGGSIVKMTVESQCRAELDRRSEWRLTALAMSAEKQA 
70 80 90 100 110 

130 140 150 160 170 

ralO 8 . pep EWENKICACVAQEAPERMTGNDVMQMLAPSTRNQALAALTAKTVSACFKHLYRX 

|||||||:|:::||h::|||||||ll I I 1 I I I I I I I I : I I I I I I I I : I I I 
g 1 0 8 EWENKI CGCATEEAPNQLTGNDVMQMLNQSTRNQALAALTVKTVS ACFKRLYRX 

120 130 140 150 160 170 

The following partial DNA sequence was identified in meningitidis <SEQ ID 385>: 

alOS.seq 

1 ATGTTGCCGG GCTTCAACCG GATATTCAAA CGGTTTGTTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA TCGCCATA/^ AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

201 CGCCGCCGGG GGCGGTAACA CATTCGGCAG CTTAGACGGC GGCACAGGTA 

251 TGGGCGGCAG CATCGTCAAA ATGGCGGTAG AAAGCCAATG CCGTGCGGAA 

301 TTGAACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

351 AAAACAGGCG GAATGGGAAA ACAAGATTTG CGCTTGCGTC GCCCT^GAAG 

4 01 CACCCAACCA GCTGACCGGC AACGATGTGA TGCAGATGCT GGATCCGTCC 

4 51 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 

501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 386; ORF 108.a>: 
alOS.pep 

1 MLPGFNRI FK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 
51 MNKT LSILPV AILLGGCAAG G GNTFGSLDG GTGMGGSIVK MAVESQCRAE 
101 LNKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPNQLTG NDVMQMLDPS 
151 TRNQALAALT AKTVSACFKH LYR* 



ml08/al08 



96.5% identity in 173 aa overlap 



10 20 30 40 50 60 

mlOB.pep MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 
M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I 
al08 MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 08 . pep AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVGSQCRAELDKRSEWRLTALAMSAEKQA 
t ) I I I I I I I I M I I I I i n I I I I t I I I I I I I I I I M I I I I : I I I I I I I I I I I I I I I I I I 
al08 AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVESQCRAELNKRSEWRLTALAMSAEKQA 

70 80 90 100 110 120 



130 140 150 160 170 

ml08 . pep EWENKICACVAQEAPERMTGNDVMQMLAPSTRNQALAALTAKTVSACFKHLYRX 
I It I I I i I I I I I I I I ::: I I I I I I I I I I I I I I I I I I t I I I I I I I t I M I I I I I 
al08 EWENKICACVAQEAPNQLTGNDVMQMLDPSTRNQALAALTAKTVSACFKHLYRX 

130 140 150 160 170 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 387>: 
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gl09 . seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GATCGTAGGC GTATGCTTAC CGCTTTTGGA AGCGGGCATG 

101 GAAATGACGC GCAAAGGCAA AACCACCCAA TCCGCCGCCA TCGTGGTGTT 

151 CTCTTCCGTC TGGTCAATCC GGTTTTCGGC TGGGCGTTGA CGATGCTGTT 

201 GGATAATTTG GGCTTAATCG GCTGCAAAGA ACGCAGCGCG CAATTAGGTT 

251 TTGTCGGACG AGTATTGATA CCCGCAGTAG GTTTCTTAAT CTTGTGTGTG 

301 GCGATGGGTG CGGTCGGGAT GCTGCCCGGT ATCCCTCCGT TTTTGGAGCA 

351 GTTCAAATCT TTGGGCTAG 

This corresponds to the amino acid sequence <SEQ ID 388; ORF 109.ng>: 

gl09 .pep 

1 MYYRRWGLS DGLGDLAAGI DRRRMLTAFG SGHGNDAQRQ NHPIRRHRGV 

51 LFRLVNPVFG WALTMLLDNL GLIGCKERSA QLGFVG RVLI PAVGFLILCV 

101 AMGAVGMLPG IPPFLEQFKS LG 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 389>: 

ml09 . seq 

1 ATGTATTATC GCCGGGTTAT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CATCGTGGTG TTCTCTTCCG CCTTGTCAAT CCGGTTTTCG GCTGGGCGTT 

201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGTG 

251 CGCAATTAGG TTTCGCCGGA CGCGTGTTGA TACCCGCAGT AGGTTTCTTG 

301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

351 GTTTTTGGAA CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 109>: 

ml09 .pep 

1 MYYRRVMGIiS DGLGDLAAGI ERSLGRRRIL TAFGSGHGND AQRQNHPIRR 

51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFAG R VLIPAVGFL 

101 XLCVAMGAV G MLPGIPPFLE HFKSLG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 109 shows 92.9% identity over a 126 aa overlap with a predicted ORF (ORF 109.ng) 

from A^. gonorrhoeae: 
ml09/gl09 

10 20 30 40 50 60 

MYYRRVMGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 

Illllhllllllllllllhl IhlllllllllMIIIIIIIIIMIIIIIIIII 

MYYRRWGLSDGLGDLAAGIDR RRMLTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 

10 20 30 40 50 

70 80 90 100 110 120 

PVFGWALTMLLDNLGLIGCKERSAQLGFAGRVLI PAVGFLILCVAMGAVGMLPGI PPFLE 

MIIIIIIIIIIMIIIIIIMIMIIhlllllllllllilllllllllllllllllll 

PVFGWALTMLLDNLGLIGCKERSAQLGFVGRVLIPAVGFLILCVAMGAVGMLPGIPPFLE 
60 70 80 90 100 110 



'ml09 .pep 

gio9 

ml09 .pep 
gl09 



ml 09. pep HFKSLGX 

:MIII 
gl09 QFKSLGX 
120 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 391>: 

al09. seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AA7VACCACCC AATCCGCCGC 

151 CACCGTGGTG TTCTCTTCCG CTTGGTCMT CCGGTTTTCG GCTGGGCGTT 
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201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGCG 

251 CGCAATTAGG TTTCACCGGA CGCGTATTGA TACCCGTAGT AGGTTTCTTG 

301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

351 GTTTTTGGAG CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 392; ORF 109>: 
al09.pep 

1 MYYRRVVGLS DGLGDLAAGI ERSLGRRRIL TAFGSGHGND AQRQNHPIRR 
51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFTG R VLIPWGFL 
101 ILCVAMGA VG MLPGIPPFLE HFKSLG* 

ml09/al09 97.6% identity in 126 aa overlap 

10 20 30 40 50 60 

itll09.pep MYYRRVMGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 
I I 11 I I : I i I I I I I I I I I M I i I I I I I I 1 I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
al09 MYYRRWGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 

10 20 30 40 50 60 

70 80 90 100 110 120 

inl09.pep PVFGWALTMLLDNLGLIGCKERSAQLGFAGRVLIPAVGFLILCVAMGAVGMLPGIPPFLE 
I I M I t I I i I I I I I I i I I I I I I I I I I I I : I I I I i I : I I I I I t I I I I I I I I I M I I I I I I I 
al 0 9 PVFGWALTMLLDNLGLIGCKERSAQLGFTGRVLI PVVGFLILCVAMGAVGMLPGI PPFLE 

70 80 90 100 110 120 



ml 09. pep HFKSLGX 
I M I I I ! 

al09 HFKSLGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 393>: 

gill . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGaacaaacC GCGCAaaccg 

101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCtATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 

201 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGtCCaCC TACCAGACCG 

251 ATTCCGAAAT CAGCCGGTTt atacagacan atgctggaga gctcttcgcg 

301 tntcatgcag nttctataac tgattccgcc gaagactgtc tgcctaatac 

351 gcctatctca tcggcgctct ga 

This corresponds to the amino acid sequence <SEQ ID 394; ORF 1 1 Lng>: 

gill. pep 

1 MPSETRLPNL IRALIFALGF IFLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF IQTAGELFAH 

101 ASITDSAEDC LPNTPISSAL * 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 395>: 

mill . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATAyCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AwAAACGCAT 

201 CGATGACGCG CTTAAAGAAk TCAACCGGyA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

451 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
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601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 

701 AGCAGCCCAA TATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 

801 TAAAAACGGC AAACGCCTCT CCCATATCAT CAACCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGcTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 396; ORF 1 1 1>: 

mill .pep 

1 MPSETRLPNF IRVLIFALGF IFLMA CSEQT AQTVTLQGET MGTTYXVKYL 
51 SNNRDKLPSP AEIXKRIDDA LKEXNRXMST YQPDSEISRF NQHTAGKPLR 
101 ISSDF/^TA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 
151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 
201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 
251 NNRSLATSGD YRIFHVDKNG KRLSHIINPKf NKRPISHNLA SISWADSAM 
301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 
351 R* 

ORF 1 1 1 shows 88.7% identity over a 97 aa overlap with a predicted ORF (ORF 1 1 1 .ng) 
from N. gonorrhoeae: 

mill .pep/glll .pep 

10 20 30 40 50 60 

mill .pep MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSP 

llllllllhlhllllllllllllllllllllllllillllllhlllllllllMIII 

gill MPSETRLPNLIRALIFALGFIFl^ACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
10 20 30 40 50 60 

70 80 90 100 110 120 

mill . pep AEIXKRIDDALKEXNRXMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 

hi lllilllll II Illll lllllll I :||: 
gill AKIQKRIDDALKEVNRQMSTYQTDSEISRFIQTXAGELFAXHAXSITDSAEDCLPNTPIS 

70 80 90 100 110 120 

130 140 150 160 170 180 

mill . pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 

gill SALX 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 397>: 

alll.seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAAG CAGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCTTGGCGC ATCGGCATCG 

701 AACAGCCCAA CATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 
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801 TAAAAGCGGC AAACGCCTCT CCCATATCAT TAATCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 398; ORF 1 ll.a>: 
alll .pep 

1 MPSETRLPNF IRTLIFALSF IFLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVHLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKSG KRLSHIINPN NKRPISHNLA SISWADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 

mlll/alll 97.7% identity in 351 aa overlap 

10 20 30 40 50 60 

mill, pep MPSETRLPNFIRVLI FALG FI FLNACSEQTAQTVTLQGETMGTT YXVKYLSNNRDKLPS P 

I I I I I I I I I I I I : I I I I I : I I 1 I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I i I t I I 
alll MPSETRLPNFIRTLIFALSFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 . 50 60 

70 80 90 100 110 120 

mill . pep AEIXKRIDDALKEXNRXMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 
III I It I II I II II I I II I II I I I I I I I II I I I I I I I I I I I I I I I I I I I i : 1 I I I I I 
alll AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVHLNRLTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

mill . pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
I I I I I II I I I I I I I I I M I I I I I I I I II I I I I I I I I I i I I I II I I I I I I I I I I I I M I I I 
alll GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

mill . pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I I I M I II I I I I II M I I I I I I I II I I I I I I I I I I I I I I I I I I I II II I I I I II I I I I I I 
alll AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 

190 200 210 220 230 240 

250 260 270 280 . 290 300 

mill, pep GGNTQIIVPLNNRSLATSGDYRIFHVDKNGKRLSHIINPNNKRPISHNLASISWADSAM 
I I I I I I t I I I II I I I I I I I I II I I I I I I : I I M I I I I I I I I I t I II I It I I I I I I I I I I I 
alll GGNTQIIVPLNNRSLATSGDYRIFHVDKSGKRLSHIINPNNKRPISHNLASISWADSAM 

250 260 270 280 290 300 

310 320 330 340 350 

mill . pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 
I I I I I I I I I I I I I I I I I I I I i I I I I t I I I I I I I I I I It I I I i I I II I I t I I I 
alll TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 

310 320 330 340 350 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ED 399>: 

glll-l.seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 
51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAacCG 
101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCTATACCGT CAAATACCTT 
151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 
201 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TACCAGACCG 
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251 ATTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ATTTCGCACA CGTTACCGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACTCAC GGCGCACTGG ACGTAACCGT CGGCCCTTTG GTCAACCTTT 

401 GGGGGTTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

451 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGCAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAA GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCggcGAGTT 

651 GCACGGCAAA GGCAAAAATG CGCACGGCGA ACCGTGGCGC ATCGGTATAG 

701 AGCAACCCAA TATcatccaa ggcggcaata cgcAGattat cgtcccgctg 

751 .aaCaaccgtt cgcttgccac ttccggcgAT taccgtaTTT tccacgtcgA 

801 TAAAAACGGC Aaacgccttt cccacATCAT CAATCCCAAC AACAAACGAC 

851 CCATCAGcCA CAAcctcgcc tcCATCAgCg TGGTCTCAGA CAGTGCAATG 

901 ACGGCGGACG GTTTATCCAC AGGATTATTT GTTTTAGGCG AAACCGAAGC 

951 CTTAAGGCTG GCAGAACAAG AAAAACTCGC TGTTTTCCTA ATTGTCCGGG 

1001 ATAAGGACGG CTACCGCACC GCCATGTCTT CCGAATTTGC CAAGCTGCTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 400; ORF 1 1 1-1, ng>: 

glll-l.pep 

1 MPSETRLPNL IRALIFALGF IFLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILQQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNAHGEPWR IGIEQPNIIQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISVVSDSAM 

301 TADGLSTGLF VLGETEALRL AEQEKLAVFL IVRDKDGYRT AMSSEFAKLL 

351 R* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 401>: 

mlll-l. seq 

1 ATGCCGTCTG AAACACGCCT 

51 CCTGGGTTTC ATCTTCCTGA 

101 TTACCCTGCA AGGCGAAACG 

151 TCAAATAATC GGGACAAACT 

201 CGATGACGCG CTTAAAGAAG 

251 ACTCCGAAAT CAGCCGGTTC 

301 ATTTCAAGCG ACTTCGCACA 

351 CCTGACACAC GGCGCGCTGG 

401 GGGGATTCGG CCCCGACAAA 

451 ATCAAACAGG CGGCATCTTA 

501 AGGCAAAGAT TACGCTTCCT 

551 ATTTATCTTC GATTGCCAAA 

601 CTGGAAAAAT ACGGCATTCA 

651 GCACGGCAAA GGCAAAAACG 

701 AGCAGCCCAA TATCGTCCAA 

751 AACAACCGTT CGCTTGCCAC 

801 TAAAAACGGC AAACGCCTCT 

851 CCATCAGCCA CAACCTCGCC 

901 ACGGCGGACG GCTTGTCCAC 

951 CTTAAAGCTG GCAGAGCGCG 

1001 ATAAAGGCGG CTACCGCACC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 402; ORF 1 1 1-1>: 

mlll-l.pep 

1 MPSETRLPNF IRVLIFALGF IFLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISWADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 

mlll-l/glll-1 96.6% identity in 351 aa overlap 

10 20 30 40 50 60 

ml 11-1. pep MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
IIIIMIII:||:IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII||||||||||| 
glll-1 MPSETRLPNLIRALIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 



GCCGAACTTT ATCCGCGTCT TGATATTTGC 
ACGCCTGTTC GGAACAAACC GCGCAAACCG 
ATGGGCACGA CCTATACCGT CAAATACCTT 
CCCCTCACCT GCCGAAATAC AAAAACGCAT 
TCAACCGGCA GATGTCCACC TATCAGCCCG 
AACCAACACA CAGCCGGCAA GCCCCTCCGC 
CGTTACTGCC GAAGCCGTCC GCCTGAACCG 
ACGTAACCGT CGGCCCCTTG GTCAACCTTT 
TCCGTTACCC GTGAACCGTC GCCGGAACAA 
TACGGGCATA GACAAAATCA TTTTGAAACA 
TGAGCAAAAC CCACCCCAAG GCCTATTTGG 
GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
AAATTATCTG GTCGAAATCG GCGGCGAGTT 
CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 
GGCGGCAATA CGCAGATTAT CGTCCCGCTG 
TTCCGGCGAT TACCGTATTT TCCACGTCGA 
CCCATATCAT CAACCCGAAC AACAAACGAC 
TCCATCAGCG TGGTCGCAGA CAGTGCGATG 
AGGATTATTC GTATTGGGCG AAACCGAAGC 
AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 
GCCATGTCTT CCGAATTTGA AAAACTGCTC 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ml 11-1. pep AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 
1 : M I I I M I I I I I I I I i I I I I lilllllllllllllllllinilltlllllllltii 
glll-1 AKIQKRIDDALKEVNRQMSTYQTDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

mlll-l.pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
lll)|||IMIIIIIIItIIIIIII)Mllllllillllllllll:lll)llllllllll 
glll-1 GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAAS YTGI DKIILQQGKDYASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 11-1 .pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I I I I I ) I I I I I I I I I I I I I I I I I I I I I t I- 1 I I I I I I I I I I t i I 1 : I I I I I i I I t I I I I : I 
glll-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNAHGEPWRIGIEQPNIIQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

mlll-l.pep GGNTQIIVPLNNRSLATSGDYRIFHVDKNGKRLSHIINPNNKRPISHNLASISVVADSAM 
I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I i I I t I I I I I M I I I I I I I I I I I I I I I I : I I I I 
glll-1 GGNTQIIVPLNNRSLATSGDYRIFHVDKNGKRLSHIINPNNKRPISHNLASISVVSDSAM 

250 260 270 280 290 300 

310 320 330 340 350 

mil 1-1 .pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 
lllllllllll)llllll:|||:|||||||||||l llllllllll Mill 
glll-1 TADGLSTGLFVLGETEALRLAEQEKLAVFLIVRDKDGYRTAMSSEFAKLLRX 

310 320 330 340 350 

glll-l/p44550 

sp|P4 4550iYOJL_HAEIN HYPOTHETICAL LIPOPROTEIN HI0172 PRECURSOR >gi | 1074292 1 pir | | C64144 
hypothetical protein HI0172 - Haemophilus influenzae (strain Rd KW20) >gi 11573128 <U32702) 
lipoprotein, putative {Haemophilus influenzae Rd] Length - 346 
Score = 349 bits (885), Expect « 2e-95 

Identities - 177/328 (53%), Positives = 240/328 (72%), Gaps = 4/328 (1%) 



Query: 


23 


LNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSPAEIXKRIDDALKEXNRXMSTYQ 


82 






L AC ++T + ++L G+TMGTTY VKYL + S + + 1+ LK+ N MSTY+ 




Sbjct: 


17 


LAACQKET-KVISLSGKTMGTTYHVKYLDDGSITATS-EKTHEEIEAILKDVNAKMSTYK 


74 


Query: 


83 


PDSEISRFHQHT-AGKPLRISSDFAHVTAEAVRLNRLTHGALDVTVGPLVNLWGFGPDKS 


141 






DSE+SRFNQ+T P+ IS+DFA V AEA+RLN++T GALDVTVGP+VNLWGFGP+K 




Sbjct: 


75 


KDSELSRFNQNTQVNTPIEISADFAKVLAEAIRLNKVTEGALDVTVGPVVNLWGFGPEKR 


134 


Query: 


142 


VTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPKAYLDLSSIAKGFGVDKVAGEL 


201 






++P+PEQ+ + ++ GIDKI L K+ A+LSK P+ Y+DLSSIAKGFGVD+VA +L 




Sbjct: 


135 


PEKQPTPEQLAERQAWVGIDKITLDTNKEKATLSKALPQVYVDLSSIAKGFGVDQVAEKL 


194 


Query: 


202 


EKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQGGNTQIIVPLNNRSLATSGDY 


261 






E+ QNY+VEIGGE+ KGKN G+PW+I lE+P + ++ LNN +A+SGDY 




Sbjct: 


195 


EQLNAQNYMVEIGGEIRAKGKNIEGKPWQIAIEKPTTTGERAVEAVIGLNNMGMASSGDY 


254 


Query: 


262 


RIFHVDKNGKRLSHIINPNNKRPISHNLASISWADSAMTADGLSTGLFVLGETEALKLA 


321 






RI+ ++NGKR +H I+P PI H+LASI+V+A ++MTADGLSTGLFVLGE +AL++A 




Sbjct: 


255 


RIY-FEENGKRFAHEIDPKTGYPIQHHLASITVLAPTSMTADGLSTGLFVLGEDKALEVA 


313 


Query: 


322 


EREKLAVFLIVRDKGGYRTAMSSEFEKL 349 








E+ LAV+LI+R G+ T SS F+KL 




Sbjct: 


314 


EKNNLAVYLIIRTDNGFVTKSSSAFKKL 341 





The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 403>: 

alll-l.seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 
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351 


CCTGACACAC 


401 


GGGGATTCGG 


451 


ATCAAACAAG 


501 


AGGCAAAGAT 


551 


ATTTATCTTC 


601 


CTGGAAAAAT 


651 


GCACGGCAAA 


701 


AACAGCCCAA 


751 


AACAACCGTT 


801 


TAAAAGCGGC 


851 


CCATCAGCCA 


901 


ACGGCGGACG 


951 


CTTAAAGCTG 


1001 


ATAAAGGCGG 


1051 


CGCTAA 



GGCGCGCTGG 
CCCCGACAAA 
CAGCATCTTA 
TACGCTTCCT 
GATTGCCAAA 
ACGGCATTCA 
GGCAAAAACG 
CATCGTCCAA 
CGCTTGCCAC 
AAACGCCTCT 
CAACCTCGCC 
GCTTGTCCAC 
GCAGAGCGCG 
CTACCGCACC 



ACGTAACCGT 
TCCGTTACCC 
TACGGGCATA 
TGAGCAAAAC 
GGCTTCGGCG 
AAATTATCTG 
CGCGCGGCGA 
GGCGGCAATA 
TTCCGGCGAT 
CCCATATCAT 
TCCATCAGCG 
AGGATTATTC 
AAAAACTCGC 
GCCATGTCTT 



CGGCCCCTTG 
GTGAACCGTC 
GACAAAATCA 
CCACCCCAAG 
TTGATAAAGT 
GTCGAAATCG 
ACCTTGGCGC 
CGCAGATTAT 
TACCGTATTT 
TAATCCGAAC 
TGGTCGCAGA 
GTATTGGGCG 
TGTTTTCCTG 
CCGAATTTGA 



GTCAACCTTT 
GCCGGAACAA 
TTTTGAAACA 
GCCTATTTGG 
TGCGGGCGAA 
GCGGCGAGTT 
ATCGGCATCG 
CGTCCCGCTG 
TCCACGTCGA 
AACAAACGAC 
CAGTGCGATG 
AAACCGAAGC 
ATTGTCAGGG 
AAAACTGCTC 



This corresponds to the amino acid sequence <SEQ ID 404; ORF 1 1 l-l.a>; 

alll-l .pep 

1 MPSETRLPNF IRTLIFALSF IFLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVHLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKSG KRLSHIINPN NKRPISHNLA SISWADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 



alll-l/mlll-1 98.9% identity in 351 aa overlap 

10 20 30 40 50 60 

alll-l. pep MPSETRLPNFIRTLI FALS FI FLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
llllllllllll:|llll:|||)IIIIIIIIIMIIIItlllllllllllll)ll|||l| 
ml 11-1 MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 



70 80 90 100 110 120 

alll-l. pep AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVHLNRLTH 
I I I ) I i I I I I I I t I I I I I I I I I t I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I : M I I I I 
mlll-1 AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 

70 80 90 100 110 120 



130 140 150 160 170 180 

alll-l. pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
I t I t I i I I I i I I I I I I i I I I I I 1 I I I I I I I I I M I I I I I I I I I I I 1 I I I I t I I I I I I I I I 
mlll-1 GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 

130 140 150 160 170 180 



190 200 210 220 230 240 

alll-l .pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I I I I I I I I I i 1 I I I I I 1 I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I 
mlll-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

alll-l. pep GGNTQI I VPLNNRSLATSGDYRI FHVDKSGKRLSHI INPNNKRPI SHNLAS I S VVADSAM 

IIIIIIIIIIIIIIIMillltllllll:lllll Illllllllltllllllllll 

ml 1 1 - 1 GGHTQI I VPLNNRS LATSGDYRI FH VDKNGKRLSHI I NPNNKRP I SHNLAS I S WADS AM 

250 260 270 280 290 300 



310 320 330 340 350 

alll-l .pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 

lillllllllMIIMIIIIIIIIIIIItllllllllllllllllMIIMI 

ml 1 1- 1 TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 
310 320 330 340 350 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ DD 405>: 

gll4 . seq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCAGCAGG AATGCAGCAA 
51 GACTTTTTTA TGTCCGCCGG GCGGGACGAG TATGGGGCGG TCAATGTCGG 
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328 



101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTCGAA 

151 TACGGTCAAA GCGGCTATTT TACCAGAGCC GCCGAATGTA AAACAGGGTG 

201 TCAGGGCATC AGCCCGAGCT GCCTGAACGA ACGGACGGTT TGCGAGGTAA 

251 CGATAAAATG GTCGAGCAGC GAAACATCAA CCAGCGACAT GGCCTGTGCC 

301 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAAGGTTCAG GCGAGCCGCC 

351 CGGATGGTTG TGCGCGATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

401 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 406; ORF 1 14.ng>: 

gll4 .pep 

1 MASITSPLHG AQQECSKTFL CPPGGTSMGR SM SVTVGLFC VSINLTISV E 

51 YGQSGYFTRA AECKTGCQGI SPSCLNERTV CEVTIKWSSS ETSTSDMACA 

101 SRLVNMMSSC EGSGEPPGWL CAIIRLSAYS SNASLTISRM * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 407>: 

mll4.seq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCACAGAG AATGCAGCAA 

51 GACTTTTTTA TGTCCACCGG GCGGGACGAG TATAGGGCGG TCAATGTCGG 

101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTTGAA 

151 TACGGTTGAA GCGGCTATTT TATCAGAGCC GCCGCATGTA AAACAGAGTG 

201 TCAGGGCATC AACCCGAGCT GTCTGAACGA ACAGACGCTT TGCGAkGTAA 

251 CGATAAAATG GTCGAGCAGC GACACATCGA CCAGCGACAT TGCCTGTGCC 

301 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAArGTTCsG GCGAGCCGcC 

351 CGgATGGTTG TGCGCAATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

401 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 408; ORF 11 4>: 

ml 14. pep 

1 MASITSPLHG AHRECSKTFL CPPGGTSIGR S MSVTVGLFC VSINLTISV E 

51 YGXSGYFIRA AACKTECQGI NPSCLNEQTL CXVTIKWSSS DTSTSDIACA 

101 SRLVNMMSSC EXSGEPPGWL CAIIRLSAYS SNASLTISRM * 



mll4/gll4 90.0% identity over a 140 aa overlap 

10 20 30 40 50 60 

mll4 . pep MASITSPLHGAHRECSKTFLCPPGGTSIGRSMSVTVGLFCVSINLTISVEYGXSGYFIRA 

llllllllllhHIIIIIIIIIIIIhllllllllllllllllllMIIII INI II 

gll4 MASITSPLHGAQQECSKTFLCPPGGTSMGRSMSVTVGLFCVSINLTISVEYGQSGYFTRA 
10 20 30 40 50 60 

70 80 90 100 110 120 

mll4 .pep AACKTECQGINPSCLNEQTLCXVTIKWSSSDTSTSDIACASRLVNMMSSCEXSGEPPGWL 
I III lllhlllllhhl lllllllhllllhllllllllllllll llllllli 
gll4 AECKTGCOGISPSCLNERTVCEVTIKWSSSETSTSDMACASRLVNMMSSCEGSGEPPGWL 

70 80 90 100 110 120 



130 140 
mll4 .pep CAIIRLSAYSSNASLTISRMX 

llllllllllllllllillll 
gll4 CAIIRLSAYSSNASLTISRMX 
130 140 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 409>: 

all4 . seq 

1 ATGCCGGAGG CAAGCATCGC CTCCATCACT TCGCCGCTGC ACGGGGCGCA 

51 ACAGGAATGC AGCAAGACTT TTTTATGTCC GCCGGGCGGG ACGAGTATGG 

101 GGCGGTCAAT GTCGGTAACG GTAGGTTTGT TTTGTGTTTC CATTAACTTA 

151 ACGATATCTG TCGAATACGG TTGAAGCGGC TATTTTATCA GAGCCGCCGC 

201 ATGTAAAACA GGGTGTCAGG GCATCAGCCC GAGCTGCCTG MCGAACGGA 

251 CGGTTTGCGC CGTTACGATA AAATGGTCGA GCAGCGACAC ATCGACCAGC 

301 GACATTGCCT GTGCCAGCCG CCTTGTGAAC ATGATGTCTT CCTGCGAAGG 

351 TTCGGGCGAG CCGCCCGGAT GGTTGTGCGC GATAATCAGG CTGTCGGCAT 

401 ATTCGTCCAA TGCCAGTTTG ACAATTTCAC GGATGTAA 
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This corresponds to the amino acid sequence <SEQ ID 410; ORF 1 14.a>: 

all4 .pep 

1 MPEASIASIT SPLHGAQQEC SKTFLCPPGG TSMGRSM SVT VGLFCVSINL 

51 TISV EYG*SG YFIRAAACKT GCQGISPSCL NERTVCAVTI KWSSSDTSTS 

101 DIACASRLVN MMSSCEGSGE PPGWLCAIIR LSAYSSNASL TISRM* 

iiill4/all4 92.9% identity in 140 aa overlap 

10 20 30 40 50 

mll4 .pep MASITSPLHGAHRECSKTFLCPPGGTSIGRSMSVTVGLFCVSINLTISVEYGXSG 
: I I I I I I I I I i :: I I I I I I I I I i I I I I : I I I M I I I I I I I I I I I I I I I I t I I I I I 
all4 MPEASIASITSPLHGAQQECSKTFLCPPGGTSMGRSMSVTVGLFCVSINLTISVEYGXSG 
10 20 30 40 50 60 

60 70 80 90 100 110 

mll4 .pep YFIRAAACKTECQGINPSCLNEQTLCXVTIKWSSSDTSTSDIACASRLVKMMSSCEXSGE 
MIIIIMII I I I I I I I I I I I I I I I i I I I I I I I I I I I I I III 

all4 YFIRAAACKTGCQGISPSCLNERTVCAVTIKWSSSDTSTSDIACASRLVNMMSSCEGSGE 
70 80 90 100 110 120 



120 130 140 

mll4 .pep PPGWLCAIIRLSAYSSNASLTISRMX 
I I II I I I II I t I I II I I I I II II I II 
all4 PPGWLCAIIRLSAYSSNASLTISRMX 
130 140 



The following partial DNA sequence was 


gll7 . seq 






1 


atggtcgacg 


aactcgacCT 


51 


TGCCGACATC 


GGACGCTACG 


101 


GCTGCAACAG 


CACCGTCGCC 


151 


AAACTTACCC 


ACTTCGCCCG 


201 


CGCACAGCAA 


GCGGAAACCA 


251 


Acatccgcgt 


cgtaTTAATC 


301 


ttTTtaaOCA 


ACGCCCCCGA 


351 


aacccTCGAC 


ATCTTCGCCC 


401 


TCAAATGGCA 


GCTCGAAGAT 


451 


TACCGCGAAA 


TCGCCCTGCT 


501 


ATACATCGAA 


AACTTCCTCG 


551 


ATATCCACTT 


TGAAGTCGCC 


601 


AAAAAAATGG 


TGAAGAAAAA 


651 


CGCCGTGCGG 


ATTCTGGTCG 


701 


gcaTCGTCCA 


CAGCCTCTGG 


751 


ATCGCCAACC 


CCAAAGgcaA 


801 


cggcccGGAa 


gacaaaggtg 


851 


accAATTCaa 


CgaatTcggT 


901 


ggcaaaggcg 


attccGCCtA 


951 


CTTGGACTGG 


CGCGAAAATA 


1001 


CCGCCTTCAA 


AACCGAGCTT 


1051 


CACGGC/^G 


TCCTCTCTCT 


1101 


CTACGCCCTG 


CACAGCAGCA 


1151 


AaggGCAGAT 


TGTGCCGCTG 


1201 


GAAATcatta 


cCGCcaaAGA 


1251 


AGGctgGGtC 


aAATCCGGCA 


1301 


GCCAGcaaAa 


cgCcgaCACC 


1351 


AAGCAGCTTG 


CCAAACTCAC 


1401 


aaATCTCGGC 


tacaaAAAGC 


1451 


gcgaaatttc 


caaccgcgcc 


1501 


ccgccccCCG 


TGCCCGTCAG 



identified in A^. gonorrhoeae <SEQ ID 41 1>: 

GCTGCCCGAT GCCGTCGCCG CCACCCTGCT 
TCCCCGATTG GAACCTATTG GTTTCCGAGC 
GAGCTGGTCA AAGGTGtgga CGAAGTGCAG 
GGTGGACAGC CTCGCCACGC CGGAAGAACG 
TGCGGAAAAT GCTGCTGGCg atggttaccg 
AAACTGGCGA TGCGTacgcg caccCTGcta 
CAGCCCTGAA AAACgcgccG TCgccaaAga 
CGCTCGCCAA CCGCTTGGGC GTGTGGCAGC 
TTGGGCTTCC GCCATCAAGA ACCCGAAAAA 
TTTGGACGAA AAACGCACCG AACGCCTCGA 
ATATCCTGCG TACGGAACTC AAAAAATACA 
GGCCGTCCGA AACACATCTA CTCCATTTAC 
ACTCAGCTTC GACGgccTGT TCGACATCCG 
ATACCGTCCC CGaGTGTTAC ACCACGCTGG 
CAGCCCATTC CCGGCGagtt CGAcgactAC 
CGgttATAAA AGtTTGCACA CCGTCATCGT 
tggaaGtgCA AATCCGCACC TTCGAtatGC 
gtcgccgCCC ACTGGCGtta caaagaaggc 
cgaacaaAAA ATcgccTggt TGCgccaACT 
TGGCGGAAAG CGGCAAGGAA GACCTCGCCG 
TTCAACGACA CGATTTATGT TTTGACCCCG 
GCCAACGGGC GCAACCCCCA TCGACTTCGC 
TcggCGACCG CTGCCGGGGC GCGAAAGTCG 
TCCACCCCGC TCGAAAACGG ACAGCGCGTC 
AGGGCATCCT TCCGTCAACT GGCTTTACGA 
AGGCCATCGG caaAATCCGC GCCTAcatCC 
GTGCGCGAAG AAGGCCGTGT CCAACTCGAC 
GCCCAAACCC AACCTGCAAG AGCTTgccga 
cagaagacct ctacacCGCc gtcggacaag 
atCcaaaaag cctgcggcac GCTgaacgaa 
CGCAACCACC ATCGTCAAAC AGTCCAAAAT 
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1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 

This corresponds to the amino acid sequence <SEQ ID 412; ORF 1 17.ng>: 

gll7 .pep 

1 MVDELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMRK MLLA MVTDIRWLI KLAM RTRTLL 

101 FLSNAPDSPE KRAVAKETLD IFAPLANRLG VWQLKWQLED LGFRHQEPEK 

151 YREIAIiLLDE KRTERLEYIE NFLDILRTEL KKYNIHFEVA GRPKHIYSIY 

201 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 

251 lANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 

301 GKGDSAYEQK lAWLRQLLDW RENM7VESGKE DLAAAFKTEL FNDTIYVLTP 

351 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 

4 01 EIITAKEGHP SVNWLYEGWV KSGKAIGKIR AYIRQQNADT VREEGRVQLD 

4 51 KQLAKLTPKP NLQELAENLG YKKPEDLYTA VGQGEISNRA IQKACGTLNE 

501 PPPVPVSATT IVKQSKIKKG GKTGVLIDGE DGLMTTLAKC CKPAPPDDIA 

551 GFVTRERGIS VHRKTCPSFR HLAEHAPEKV LDASWAALQE GQVFAVDIEI 

601 RAQDRSGLLR DVSDALARHK LNVTAVQTQS RDLEASMRFT LEVKQVNDLP 

651 RVLAGLGDVK GVLSVTRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 413>: 

mll7.seq (partial) 

1 . . GTGAAACTCA AGAAATACAA TGTCCATTTC GAAGTCGCCG GCCGCCCGAA 

51 ACACATCTAC TCCATTTACA AAAAAATGGT GAAGAAAAAA CTCAGCTTCG 

101 ACGGCCTCTT TGACATCCGC GCCGTGCGAA TTCTGGTTGA TACCGTCCCC 

151 GAGTGTTACA CCACGCTGGG TATCGTCCAC AGCCTCTGGC AGCCCATTCC 

201 CGGCGAGTTC GACGACTACA TCGCCAATCC CAAAGGCAAC GGCTATAAAA 

251 GTTTGCACAC CGTCATCGTC GGCCCGGAAG ACAAAGGCGT GGAAGTACAA 

301 ATCCGCACCT TCGATATGCA CCAATTCAAC GAATTCGGTG TCGCCGCCCA 

351 CTGgCGTTAC AAAGAGGGCG GCAAGGGCGA TTCCGCCTAC GAACAGAAAA 

401 TCGCCTGGTT GCGCCAACTC TTGGACTGGC GCGAAAACAT GGCGGAAAGC 

451 GGCAAGGAAG ACCTCGCCGC CGCCTTCAAA ACCGAGCTTT TCAACGACAC 

501 GATTTATGTT TTGACCCCGC ACGGCAAAGT CCTCTCCCTG CCCACGGGCG 

551 CGACCCCCAT CGACTTCGCC TACGCCCTGC ACAGCAGCAT CGGCGACCGT 

601 TGCCGCGGTG CGAAAGTCGA AGGGCAGATT GTGCCGCTGT CCACCCCGCT 

651 CGAAAACGGA CAGCGCGTCG AAATCATTAC CGCCAAAGAA GGGCATCCTT 

701 CCGTCAACTG GCTTTACGAA GGCTGGGTCA AATCCAACAA GGCAATCGGC 

751 AAAATCCGCG CCTACATCCG CCAGCAAAAC GCCGACACCG TGCGCGAAGA 

801 AGGCCGCGTC CAACTCGACA AACAGCTTGC CAAACTCACG CCCAAACCCA 

851 ACCTGCAAGA GCTTGCCGAA AATCTCGGCT ACAAAAAGCC AGAAGACCTC 

901 TACACCGCCG TCGGACAAGG CGAAATTTCC AACCGCGCCA TCCAAAAAGC 

951 CTGCGGCACg CTGAACGAAC CGCCGCCCGT ACCCGTCAGC GAAACCACCA 

1001 TCGTCAAACA GTCCAAAATC AAAAAAGGCG GCAAAAACGG CGTGCTCATC 

1051 GACGGCGAAG ACGGTCTGAT GACCACGCTT GCCAAATGCT GCAAACCCGC 

1101 GCCGCCCGAC GATATTATCG GCTTCGTTAC CCGCGAGCGC GgCATTTCAG 

1151 TGCACCGCAA AwyyTkCyCG TCTTTCCAAC ACCTCGCCGA ACACGCGCCC 

1201 GAwAAAGTGC TGGACGCAAG CTGGGCGGCA TTGCAGGAAG GACAAGTATT 

1251 CGCCGTCGAT ATCGAAATCC GCGCCCAAGA CCGCTCCGGG CTTTTGCGCG 

13 01 ACGTATCCGA CGCGCTCGCC CGCCACAAAC TCAACGTTAC CGCCGTGCAA 
1351 ACCCAGTCCC GCGACTTGGA AGCCAGCATG AGGTTCACGC TCGAAGTCAA 

14 01 ACAAGTCAAC GACCTCCCGC GCGTCCTCGC CAGCCTCGGC GACGTCAAAG 
14 51 GCGTATTGAG CGTTACCCGG CTTTAA 

This corresponds to the amino acid sequence <SEQ ID 414; ORF 1 17>: 

mll7 .pep (partial) 
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CAAAAAAGGT GGCAAAACCG GCGTGCTCAT CGACGGCGAA GACGGCTTGA 
TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGATATTGCC 
GGCTTCGTTA CCCGCGAGCG CGGCATTTCC GTCCACCGCA AAACCTGCCC 
CTCTTTCCGA CACCTTGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 
GTTGGGCGGC GTTGCAGGAA GGGCAAGTGT TCGCCGTCGA TATCGAAATC 
CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 
CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 
AAGCCAGCAT GAGGTTCACG CTCGAAGTCA AACAAGtCAA CGacCTCCCG 
CGCGTCCTCG CCGGCCTCGG CGATGTCAAA GGCGTATTGA GCGTTACCCG 
GCTTTAA 
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1. . . .VKLKKYNVHF EVAGRPKHIY SIYKKMVKKK LSFDGLFDIR AVRILVDTVP 

51 ECYTTLGIVH SLWQPIPGEF DDYIANPKGN GYKSLHTVIV GPEDKGVEVQ 

101 IRTFDMHQFN EFGVAAHWRY KEGGKGDSAY EQKIAWLRQL LDWRENMAES 

151 GKEDLAAAFK TELFNDTIYV LTPHGKVLSL PTGATPIDFA YALHSSIGDR 

201 CRGAKVEGQI VPLSTPLENG QRVEIITAKE GHPSVNWLYE GWVKSNKAIG 

251 KIRAYIRQQN ADTVREEGRV QLDKQLAKLT PKPNLQELAE NLGYKKPEDL 

301 YTAVGQGEIS NRAIQKACGT LNEPPPVPVS ETTIVKQSKI KKGGKNGVLI 

351 DGEDGLMTTL AKCCKPAPPD DIIGFVTRER GISVHRKXXX SFQHLAEHAP 

401 XKVLDASWAA LQEGQVFAVD lEIRAQDRSG LLRDVSDALA RHKLNVTAVQ 

451 TQSRDLEASM RFTLEVKQVN DLPRVLASLG DVKGVLSVTR L* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 117 shows 97.6% identity over a 490 aa overlap with a predicted ORF (ORF 1 17.ng) 
from N. gonorrhoeae: 

mll7/gll7 

10 20 30 

mll7 .pep VKLKKYNVHFEVAGRPKHIYSIYKKMVKKKL 

Hlllhlllllllllllllllllllllll 
gll7 EKYREIALLLDEKRTERLEYIENFLDILRTELKKYNIHFEVAGRPKHIYSIYKKMVKKKL 
150 160 170 180 190 200 

40 50 60 70 80 90 

mll7 . pep SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
gll7 SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
210 220 230 240 250 260 

100 110 120 130 140 150 

mll7 . pep PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLIiDWRENMAESG 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

gll7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
270 280 290 300 310 320 

160 170 180 190 200 210 

mll7 . pep KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIill 

gll7 KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
330 340 350 360 370 380 

220 230 240 250 260 270 

ml 1 7 . pep PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 

lllllllllllilMlllllllillllllllllhlllllllllllllllllllllllll 
gll7 PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSGKAIGKIRAYIRQQNADTVREEGRVQ 
390 400 410 420 430 440 



ml 17 .pep 
gll7 



280 290 300 310 320 330 

LDKQIiAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

LDKOLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSA 
450 460 470 480 490 500 



340 350 360 370 380 390 

mll7 .pep TTIVKQSKIKKGGKNGVLIDGEDGLMTTLiAKCCKPAPPDDIIGFVTRERGISVHRKXXXS 
lllllllllllllhllllllllllllllllllllllllll llllllllllllll: I 
gll7 TTIVKQSKIKKGGKTGVLIDGEDGLMTTLAKCCKPAPPDDIAGFVTRERGISVHRKTCPS 
510 520 530 540 550 560 



mll7 .pep 



400 410 420 430 440 450 

FQHIAEHAPXKVLDAS WAALQEGQVFAVDI E I RAQDRSGLLRDVSDALARHKLNVTAVQT 
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hlllllll IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

FTUil^HAPEKVIJDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
570 580 590 600 610 620 

460 470 480 490 

QSRDLEASMRFTLEVKQVNDLPRVIiASLGDVKGVLSVTRLX 

llllillllllllllllllllllllhllllllllllllll 

QSRDLEASMRFTLEVKQVNDLPRVLAGLGDVKGVLSVTRLX 
630 640 650 660 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 415>: 
all7 . seq 

1 ATGGTTCATG AACTCGACCT GCTCCCCGAT GCCGTCGCCG CCACCCTGCT 

51 TGCCGACATC GGACGCTACG TCCCCGACTG GAACCTATTG GTTTCCGAAC 

101 GCTGCAACAG TACCGTCGCC GAGCTGGTCA AAGGTGTGGA CGAAGTGCAG 

151 AAACTCACCC ACTTCGCCCG GGTGGACAGC CTCGCCACGC CGGAAGAACG 

201 CGCCCAGCAG GCAGAAACTA TGCGGAAAAT GCTGCTGGCG ATGGTTACCG 

251 ACATCCGCGT CGTGTTAATC AAACTGGCGA TGCGTACGCG CACCCTGCAA 

301 TTTTTAAGCA ACGCCCCCGA CAGCCCCGAA AAACGCGCCG TCGCCAAAGA 

351 AACCCTCGAC ATCTTCGCCC CGCTCGCCAA CCGTTTGGGC GTGTGGCAGC 

4 01 TCAAATGGCA GCTCGAAGAT TTGGGCTTCC GCCATCAAGA ACCCGAAAAA 

451 TACCGCGAAA TCGCCCTGCT TTTGGACGAA AAACGCACCG AACGCCTCGA 

501 ATACATCGAA AACTTCCTTA ATATCCTGCG TACGGAACTC AAAAAATACA 

551 ATATCCACTT TGAAGTCGCC GGCCGTCCGA AACACATCTA CTCCATTTAC 

601 AAAAAAATGG TGAAGAAAAA ACTCAGCTTC GACGGGTTGT TCGACATCCG 

651 CGCCGTGCGG ATTCTGGTTG ATACCGTCCC CGAGTGTTAC ACCACACTGG 

701 GCATTGTCCA CAGCCTCTGG CAGCCCATTC CCGGCGAGTT CGACGACTAC 

751 ATCGCCAACC CGAAAGGCAA CGGCTATAAA AGTTTGCACA CCGTCATCGT 

801 CGGCCCGGAA GACAAAGGCG TGGAAGTGCA AATCCGCACC TTCGATATGC 

851 ACCAATTCAA CGAATTCGGT GTCGCCGCGC ACTGGCGTTA CAAAGAGGGC 

901 GGCAAAGGCG ATTCCGCCTA CGAACAAAAA ATCGCCTGGT TACGCCAACT 

951 TTTGGACTGG CGCGAA/UVCA TGGCGGAAAG CGGCAAGGAA GACCTCGCCG 

1001 CCGCCTTCAA AACCGAGCTT TTCAACGACA CGATTTATGT TTTGACCCCG 

1051 CACGGCAAAG TCCTCTCCCT GCCCACAGGC GCGACCCCCA TCGACTTCGC 

1101 CTACGCCCTG CACAGCAGCA TCGGCGACCG TTGCCGCGGT GCGAAAGTCG 

1151 AAGGGCAGAT TGTGCCGCTG TCCACCCCGC TCGAAAACGG ACAGCGTGTC 

1201 GAAATCATTA CCGCCAAAGA AGGGCATCCT TCCGTCAACT GGCTTTACGA 

1251 AGGCTGGGTC AAATCCAACA AGGCAATCGG CAAAATCCGC GCCTACATCC 

1301 GCCAGCAAAA CGCCGACACC GTGCGCGAAG AAGGCCGCGT CCAACTCGAC 

1351 AAACAGCTTG CCAAACTCAC GCCCAAACCC AACCTGCAAG AGCTTGCCGA 

1401 AAATCTCGGC TACAAAAAGC CAGAAGACCT CTACACCGCC GTCGGACAAG 

1451 GCGAAATTTC CAACCGCGCC ATCCAAAAAG CCTGCGGCAC GCTGAACG7VA 

1501 CCGCCGCCCG TACCCGTCAG CGAAACCACC ATCGTCAAAC AGTCCAAAAT 

1551 CAAAAAAGGC GGCAAAAACG GCGTGCTCAT CGACGGCGAA GACGGTCTGA 

1601 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGACATTGTC 

1651 GGCTTCGTTA CCCGCGATCG CGGCATTTCG GTACACCGCA AAACCTGCCC 

1701 CTCTTTCCGA GACCTCGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 

1751 GTTGGGCGGC GTTGCAGGAA GGACAAGTGT TCGCCGTCGA TATCGAAATC 

1801 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 

1851 CCGCCACA/^ CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 

1901 AAGCCAGCAT GAGGTTCACG CTCGAAGTCA AACAAGTTAC CGACCTCCCA 

1951 CGCGTCCTCG CCAGCCTCGG CGACGTCAAA GGCGTATTGA GCGTTACCCG 

2001 GCTTTAA 

This conresponds to the amino acid sequence <SEQ ID 416; ORF 117.a>: 

all7 .pep 

1 MVHELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMR KMLLA MVTDIRWLI KLAM RTRTLQ 

101 FLSNAPDSPE KRAVAKETLD IFAPLANRLG VWQLKWQLED LGFRHQEPEK 

151 YREIALLLDE KRTERLEYIE NFLNILRTEL KKYNIHFEVA GRPKHIYSIY 

201 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 

251 lANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 

301 GKGDSAYEQK lAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

351 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 



gll7 

mll7 .pep 
gll7 
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401 EIITAKEGHP SVNWLYEGWV KSNKAIGKIR AYIRQQNADT VREEGRVQLD 

4 51 KQLAKLTPKP NLQELAENLG YKKPEDLYTA VGQGEISNRA IQKACGTLNE 

501 PPPVPVSETT IVKQSKIKKG GKNGVLIDGE DGLMTTLAKC CKPAPPDDIV 

551 GFVTRDRGIS VHRKTCPSFR HLAEHAPEKV LDASWAALQE GQVFAVDIEI 

601 RAQDRSGLLR DVSDALARHK LNVTAVQTQS RDLEASMRFT LEVKQVTDLP 

651 RVLASLGDVK GVLSVTRL* 



mll7/all7 98.0% identity in 4 90 aa overlap 



10 20 30 

mil? .pep VKLKKYNVHFEVAGRPKHIYSIYKKMVKKKL 

: I I I I I : I I I I I I t M I I I I I I I I I I I I I I 
all? EKYREIALLLDEKRTERLEYIENFLNILRTELKKYNIHFEVAGRPKHIYSIYKKMVKKKL 
150 160 170 180 190 200 

40 50 60 70 80 90 

mll7.pep SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPXPGEFDDYIANPKGNGYKSLHTVIVG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i M I I I I I I I I I I I I ! I I i I I I I I 
all? SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
210 220 230 240 250 260 



100 110 120 130 140 150 

rail 7 . pep PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I i I I I I I I I I 
all7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
270 280 290 300 310 320 



160 170 180 190 200 210 

mll7 . pep KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 

I i 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I j 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 r 

all7 KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
330 340 350 360 370 380 



220 230 240 250 260 270 

mll7 .pep PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 

I I I I I I I I I I I I I I I 1 I I M i I I I I I I I I I I 1 i I I I I I I I I I I I M I I I i 1 I I I I i I I i I 
all7 PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 
390 400 410 420 430 440 



280 290 300 310 320 330 

ml 17. pep LDKQLiAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 
I I I I I M I I I I I I I I I I I I I I I t I I I I I I I I I I I i I I I I I I I I I I I I I I I I 1 I I I i I I I I 
all7 LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 
450 460 470 480 490 500 



340 350 360 370 380 390 

mll7 . pep TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKXXXS 
I t I I I I I I I I I I I I i I I I I i I I I I I I I 1 I I I I I I I I I I I I I : I I I I I : i I I I I I I I : I 
all7 TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIVGFVTRDRGISVHRKTCPS 
510 520 530 540 550 560 



400 410 420 430 440 450 

mll7 . pep FQHLAEHAPXKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 

1:1111111 I I I I t I I I I I I I I I I t I I I I I I t I I I I I I I I I I I I I I I I I I I I I 

all7 FRHLAEHAPEKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
570 580 590 600 610 620 



460 470 480 490 

ml 17 . pep QSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLSVTRLX 
I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 1 M I I 
all7 QSRDLEASMRFTLEVKQVTDLPRVLASLGDVKGVLSVTRLX 
' 630 640 650 660 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 417>: 

gll7-l.seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CCCTGCAAGA 

51 ATTGCGCGAA TGGTTCGACA GCTACTGCGC CGCTCTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTCGCTGG CGCAGGAACA TTATCCTGCC 

151 GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTCGACGAAC TCGACCTGCT GCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGATTGGAA CCTATTGGTT 

301 TCCGAGCGCT GCAACAGCAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTTACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

401 AAGAACGCGC ACAGCAAGCG GAAACCATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT ATTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCTGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG CTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAGAACC 

651 CGAAAAATAC CGCGAAATCG CCCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCGATA TCCTGCGTAC GGAACTCAAA 

751 AAATACAATA TCCACTTTGA AGTCGCCGGC CGTCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTGTTCG 

851 ACATCCGCGC CGTGCGGATT CTGGTCGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGCA TCGTCCACAG CCTCTGGCAG CCCATTCCCG GCGagttCGA 

951 cgactACATC GCCAACCCCA AAGgcaACGg ttATAAAAGt TTGCACACCG 

1001 TCATCGTcgg cccGGAagaa aaaggtgtgg aagtgcAAAT CCGCACCTTC 

1051 GATATGCacc AATTCaaCga ATTCGGTGTC GCCGCCCACT GGCGTTACAA 

1101 AGAAGGCGGC AAAGGCGATT CCGCCTACGA ACAAAAAATC GCCTGGTTGC 

1151 GCCAACTCTT GGACTGGCGC GAAAATATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCTCTGCC AACGGGCGCA ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGCTG CCGGGGCGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

1401 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCJVACTGGC 

1451 TTTACGAAGG CTGGGTCAAA TCCGGCAAGG CCATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGTGTCCA 

1551 ACTCGACAAG CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAZVAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTGC CCGTCAGCGC AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGTGGC AAAACCGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGCTTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 TATTGCCGGC TTCGTTACCC GCGAGCGCGG CATTTCCGTC CACCGCAAAA 

1901 CCTGCCCCTC TTTCCGACAC CTTGCCGAAC ACGCGCCCGA AAAAGTACTG 

1951 GACGCAAGTT GGGCGGCGTT GCAGGAAGGG CAAGTGTTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 

2151 CCTCCCGCGC GTCCTCGCCG GCCTCGGCGA TGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 418; ORF 117-l.ng>: 

gll7-l.pep 

1 MTAISPIQDT QSATLQELRE WFDSYCAALP DNDKNLIGTA WSLAQEHYPA 

51 DAATPYGEPL PDHFLGAAQM VDELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRK MLLAM 

151 VTDIRVVLIK LAM RTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLDILRTELK 

251 KYNIHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVIVGPEE KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF NDTIYVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

4 51 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SGKAIGKIRA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSATTI VKQSKIKKGG KTGVLIDGED 

601 GLMTTLAKCC KPAPPDDIAG FVTRERGISV HRKTCPSFRH LAEHAPEKVL 

651 DASWAALQEG QVFAVDXEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVNDLPR VLAGLGDVKG VLSVTRL* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 41 9>: 

II1117-1 . seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CTCTGCAAGA 
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51 ATTGCGCGAA TGGTTCGACA GCTACTGCGC CGCTCTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTTGCTGG CGCAGGAACA TTACCCCGCC 

151 GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTTCATGAAC TCGACCTGCT CCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGACTGGAA CCTATTGGTT 

301 TCCGAACGCT GCAACAGTAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTCACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

401 AAGAACGCGC CCAGCAGGCA . GAAACTATGC GGAAAATGCT GCTGGCGATG 

451 GTTACCGACA TCCGCGTCGT GTTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCCGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG TTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAA6ATTTG GGCTTCCGCC ATCAAAAGCC 

651 CGAAAAATAC CGCGAAATCG CGCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCAACA TCCTGCGCGG TGAACTCAAG 

751 AAATACAATG TCCATTTCGA AGTCGCCGGC CGCCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTCTTTG 

851 ACATCCGCGC CGTGCGAATT CTGGTTGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGTA TCGTCCACAG CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 

951 CGACTACATC GCCAATCCCA AAGGCAACGG CTATAAAAGT TTGCACACCG 

1001 TCATCGTCGG CCCGGAAGAC AAAGGCGTGG AAGTACAAAT CCGCACCTTC 

1051 GATATGCACC AATTCAACGA ATTCGGTGTC GCCGCCCACT GGCGTTACAA 

1101 AGAGGGCGGC AAGGGCGATT CCGCCTACGA ACAGAAAATC GCCTGGTTGC 

1151 GCCAACTCTT GGACTGGCGC GAAAACATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCCCTGCC CACGGGCGCG ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGTTG CCGCGGTGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

1401 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

1451 TTTACGAAGG CTGGGTCAAA TCCAACAAGG CAATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGCGTCCA 

1551 ACTCGACAAA CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTAC CCGTCAGCGA AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGCGGC AAAAACGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGTCTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 TATTATCGGC TTCGTTACCC GCGAGCGCGG CATTTCAGTG CACCGCAAAA 

1901 CCTGCCCGTC TTTCCAACAC CTCGCCGAAC ACGCGCCCGA AAAAGTGCTG 

1951 GACGCAAGCT GGGCGGCATT GCAGGAAGGA CAAGTATTCG CCGTCGATAT 

2001 CGAAATCCGC GCCC7UVGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 

2151 CCTCCCGCGC GTCCTCGCCA GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 420; ORF 1 17-1>: 

mll7-l.pep 

1 MTAISPIQDT QSATLQELRE WFDSYCAALP DNDKNLIGTA WLLAQEHYPA 

51 DAATPYGEPL PDHFLGAAQM VHELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRK MLLAM 

151 VTDIRVVLIK LAM RTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQKPEKY REIALLLDEK RTERLEYIEN FLNILRGELK 

251 KYNVHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVIVGPED KGVEVQTRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF NDTIYVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

451 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SNKAIGKIRA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSETTI VKQSKIKKGG KNGVLIDGED 

601 GLMTTLAKCC KPAPPDDIIG FVTRERGISV HRKTCPSFQH LAEHAPEKVL 

651 DASWT^LQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVNDLPR VLASLGDVKG VLSVTRL* 

mll7-l/gll7-l 98.2% identity in 737 aa overlap 

10 20 30 40 50 60 

ml 17-1. pep MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWLLAQEHYPADAATPYGEPL 
llllllllllllltlllMlllllllllltlllillllltl IMIItlllltlltllll 
gll7-l MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWSLAQEHYPADAATPYGEPL 

10 20 30 40 50 60 



70 



80 90 100 110 120 
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min-l . pep PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I 
gll7-l PDHFLGAAQMVDELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
70 80 90 100 110 120 

130 140 150 160 170 180 

mll7-l.pep LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 
I I M I I I I i I I I I I I I I I I I I I I I I i I I t I I I t I I I I i I I I I 1 I I I I I I I I I I I I I I I I I 
g 1 1 7 - 1 LTH FARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLI KLAMRTRTLQFLSNAPDS PEK 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 1 7 - 1 . pep RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREI ALLLDEKRTERLE YIEN 
I I I I I I I I I I i i I I I I I I I t I t I I I I I I t I I I I I I : I I I I I I I I I I I I I I I I I I I I i I i I 
g 1 1 7- 1 RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQEPEKYREI ALLLDEKRTERLE YIEN 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 17-1. pep FLNILRGELKKYNVHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 
11:111 I I I I i I : I 1 t I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gll7-l FLDILRTELKKYNIHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 1 7- 1 . pep TLGI VHSLWQPI PGEFDDYI ANPKGNGYKSLHTVI VGPEDKGVEVQIRTFDMHQFNEFGV 
I I I I I I I I I I I M I I I I I t I I I I I I I I I I I I n I I I I I I : I I I I I I t I I I I I I I I I I I I I 
gll7-l TLGI VHSLWQPI PGEFDDYI ANPKGNGYKSLHTVIVGPEEKGVEVQIRTFDMHQFNEFGV 

310 320 330 340 350 360 

370 380 390 400 410 420 

mll7-l . pep AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 
I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I i I I I I i I I I I i I I I I I I i I I I I I I I I 
gll7-l AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 

370 380 390 400 410 420 

430 440 ^ 450 460 470 480 

rall7-l.pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
I I I I i I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I 
gll7-l GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 17- 1 . pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
I I I I I I I I I t I : I I I I I I I I I I I I I I I I I I I I I I I I i I I t I I I I I I I I I I I I M t I I I I I 
gll7-l VNWLYEGWVKSGKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 17-1. pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 
I I 1 M t I I I I I I I I I I I I I 1 I 1 I I I I I I I I I I I I I t llllllllllllihIIIIIMI 
gll7-l KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSATTIVKQSKIKKGGKTGVLIDGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

mil 7-1. pep GLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I I 1 I I I I I I I I t I I I I I I lltilllllllllllllll:||||lll|||||||||||||| 
gll7-l GLMTTLAKCCKPAPPDDIAGFVTRERGISVHRKTCPSFRHLAEHAPEKVLDASWAALQEG 

610 620 630 640 650 660 

670 680 690 700 710 720 

mil 7-1 . pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I M I I 

gll7-l QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
670 680 690 700 710 720 

730 

ml 17-1. pep VLASLGDVKG VLS VTRLX 
I I !: I I I I I t I I I I I I I I 
gll7-l VLAGLGDVKG VLS VTRLX 

730 



mll7-l/RelA 
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sp|P55133|RELA_VIBSS GTP PYROPHOSPHOKINASE {ATP:GTP 3 ' -PYROPHOSPHOTRANSFERASE) (PPGPP 
SYNTHETASE I) >gi 1537617 {U13769) ppGpp synthetase I [Vibrio sp.) Length « 744 
Score « 536 bits (1366), Expect » e-151 

Identities « 288/685 (42%), Positives = 432/685 (63%), Gaps = 31/685 (4%) 

Query: 74 LDLLPDAVAATLLADI— GRYVPDWNLLVSERCNSTVAELVKGVDEVQKLTHFARVDSL 130 

L+D+ALL+ GYD +E+T+ LV+GV+++ ++ ++ S 
Sbjct: 68 LSMDADTLIAALLYPLVEGGCYSTD ALKEEYSGTILHLVQGVEQMCAIS QLKST 121 

Query: 131 ATPEERAQQAETMRKMLLAMVTDIRWLIKLAMRTRTLQFLSNAPDSPEKRAVAKETLDI 190 

A +A Q + +R+MLL+MV D R V+IKLA R L+ + + PD +RA A+E +1 
Sbjct: 122 AEETAQAAQVDNIRRMLLSMVDDFRCWIKLAERICNLREVKDQPDEV-RRAAAQECANI 180 

Query: 191 FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREIALLLDEKRTERLEYIENFLNILRGELK 250 

+APLANRLG+ QLKW++ED FR+Q P+ Y++IA L E+R +R +YI +F++ L +K 
Sbjct: 181 YAPLANRLGIGQLKWEIEDYAFRYQHPDTYKQIAKQLSERRIDREDYITHFVDDLSDAMK 240 

Query: 251 KYNVHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQ 310 

N+ EV GRPKHiySI++KM KK L FD LFD+RAVRI+ + + +CY LG+VH+ ++ 
Sbjct: 241 ASNIRAEVQGRPKHIYSIWRKMQKKSLEFDELFDVRAVRIVAEELQDCYAALGWHTKYR 300 

Query: 311 PIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFD^3HQFNEFGVAAHWRYKEG- 369 

+P EFDDY+ANPK NGY+S+HTV++GPE K +E+QIRT MH+ +E GVAAHW+YKEG 
Sbjct: 301 HLPKEFDDYVANPKPNGYQSIHTVVLGPEGKTIEIQIRTKQMHEESELGVAAHWKYKEGT 360 

Query: 370 — GKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPHGKVLSLP 427 

G SAY++KI WLR+LL W+E M++SG ++ ++++F+D +Y TP G V+ LP 
Sbjct: 361 ASGGAQSAYDEKINWLRKLLAWQEEMSDSG—EMLDELRSQVFDDRVYAFTPKGDWDLP 418 

Query: 428 TGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPSVNWLYE- 486 

+ ATP+DFAY +HS +G RC GAKVEG+IVP + L+ G +VEIIT KE +PS +WL 
Sbjct: 419 SNATPLDFAYHIHSEVGHRCIGAKVEGRIVPFTYHLQMGDQVEIITQKEPNPSRDWLNPN 478 

Query: 487 -GWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKL— TPKPNLQELAENLGYKKP 543 

G+V S++A K+ A+ R+Q+ D G+ L+ +L K+ T K A+ K P 

Sbjct: 479 LGFVTSSRARAKVHAWFRKQDRDKNIIAGKEILEAELVKIHATLKDAQYYAAKRFNVKSP 538 

Query: 544 EDLYTAVGQGEIS-NRAIQKACGTLNEPPPVPVSETTIVKQSKI KKGGKNGV 594 

E+LY +G G++ N+ I +N+P + + K S+ KK ++ V 

Sbjct: 539 EELYAGIGSGDLRINQVINHINALVNKPTAEEEDQQLLEKLSEASNKQATSHKKPQRDAV 598 

Query: 595 LIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASW 654 

+++G D LMT LA+CC+P P DDI GFVT+ RGISVHR C + L HAPE++-t-D W 
Sbjct: 599 VVEGVDNLMTHLARCCQPIPGDDIQGFVTQGRGISVHRMDCEQLEELRHHAPERIIDTVW 658 

Query: 655 AALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQ— SRDLEASMRFTLEV 712 

G + + + + A +R+GLL+++++ L K+ V ++++ + + M F LE+ 
Sbjct: 659 GGGFVGN-YTITVRVTASERNGLLKELTNTLMNEKVKVAGMKSRVDYKKQMSIMDFELEL 717 

Query: 713 KQVNDLPRVLASLGDVKGVLSVTRL 737 

+ L RVL + VK V RL 
Sbjct: 718 TDLEVLGRVLKRIEQVKDVAEAKRL 742 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 421>: 

all7-1.8oq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CTCTGCAAGA 

51 ATTGCGCGAA TGGTTCGACA GCTACTGCAC CGCGCTGCCG AACAACGATA 

101 AAAAACTTGT CTTAGCCGCC CGTTCGCTGG CGGAAGCACA TTACCCCGCC 

151 GATGCCGCCA CGCCGTATGG CGAACCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTTCATGAAC TCGACCTGCT CCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGACTGGAA CCTATTGGTT 

301 TCCGAACGCT GCAACAGTAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTCACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

401 AAGAACGCGC CCAGCAGGCA GAAACTATGC GGAAAATGCT GCTGGCGATG 

451 GTTACCGACA TCCGCGTCGT GTTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCCGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG TTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAGAACC 

651 CGAAAAATAC CGCGAAATCG CCCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTTAATA TCCTGCGTAC GGAACTCAAA 

751 AAATACAATA TCCACTTTGA AGTCGCCGGC CGTCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGGTTGTTCG 
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851 ACATCCGCGC CGTGCGGATT CTGGTTGATA CCGTCCCCGA GTGTTACACC 

901 ACACTGGGCA TTGTCCACAG CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 

951 CGACTACATC GCCAACCCGA AAGGCAACGG CTATAAAAGT TTGCACACCG 

1001 TCATCGTCGG CCCGGAAGAC AAAGGCGTGG AAGTGCAAAT CCGCACCTTC 

1051 GATATGCACC AATTCAACGA ATTCGGTGTC GCCGCGCACT GGCGTTAC7UV 

1101 AGAGGGCGGC AAAGGCGATT CCGCCTACGA ACAAAAAATC GCCTGGTTAC 

1151 GCCAACTTTT GGACTGGCGC GAAAACATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCCCTGCC CACAGGCGCG ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGTTG CCGCGGTGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

1401 GCGTGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

1451 TTTACGAAGG CTGGGTCAAA TCCAACAAGG CAATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGCGTCCA 

1551 ACTCGACAAA CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTAC CCGTCAGCGA AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGCGGC AAAAACGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGTCTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 CATTGTCGGC TTCGTTACCC GCGATCGCGG CATTTCGGTA CACCGCAAAA 

1901 CCTGCCCCTC TTTCCGACAC CTCGCCGAAC ACGCGCCCGA AAAAGTACTG 

1951 GACGCAAGTT GGGCGGCGTT GCAGGAAGGA CAAGTGTTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTTACCGA 

2151 CCTCCCACGC GTCCTCGCCA GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 422; ORF 1 17-1 .a>: 

all7-l.pep 

1 MTAISPIQDT QSATLQELRE WFDSYCTALP NNDKKLVLAA RSLAEAHYPA 

51 DAATPYGEPL PDHFLGAAQM VHELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRKMLLAM 

151 VTDIRVVLIK LAMRTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLNILRTELK 

251 KYNIHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVIVGPED KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF KDTIYVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

451 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SNKAIGKIRA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSETTI VKQSKIKKGG KNGVLIDGED 

601 GLMTTLAKCC KPAPPDDIVG FVTRDRGISV HRKTCPSFRH LAEHAPEKVL 

651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVTDLPR VLASLGDVKG VLSVTRL* 

all7-l/mll7-l 97.7% identity in 737 aa overlap 

10 20 30 40 50 60 

ml 17-1. pep MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWLLAQEHYPADAATPYGEPL 
I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I : I I I : I : : I I I : I I i i I I I I I I I I I I 
all7-l MTAISPIQDTQSATLQELREWFDSYCTALPNNDKKLVLAARSLAEAHYPADAATPYGEPL 

10 20 30 40 50 60 

70 80 90 100 110 120 

mll7-l . pep PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
i t I I I I I I M I I I I I I I 1 I I I I I I I I I I I I I I I i I I I I I I I t t I I I I I I 1 M I I I I I I I I 
all7-l PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 17-1 . pep LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRWLIKLAMRTRTLQFLSNAPDSPEK 

IIIIIIIMIMIIIIIIMMIIIIIIMIIIIIMIMI I I I I I I I I M I I I I 

all7-l LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRWLIKLAMRTRTLQFLSNAPDSPEK 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 1 7 - 1 . pep RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQKPEKYRE I ALLLDEKRTERLE YIEN 

illllllllllllllllllllMIIIIIMItllhllilllllllllllllllllllll 

al 1 7 - 1 RAVAKETLDI FAPLANRLG VWQLKWQLEDLGFRHQE PEKYRE I ALLLDEKRT ERLE Y I EN 

190 200 210 220 230 240 
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250 260 270 280 290 300 

mll7-l.pep FLNILRGELKKYNVHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIEUVVRILVDTVPECYT 
I I I I I I I I I I t I : I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I t I I I I I I I I I 
all7-l FLNILRTELKKYNIHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 17-1. pep TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGV 
I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I ) I I I I I I I I I 1 I I I I I I I I t I I I 
all7-l TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGV 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 17-1 . pep AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 
IMIIIIIIMIIilltllllllllllltllllllttllllMIIIIIMItllllllf I 
all7-l AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 

370 380 390 400 410 420 

430 440 450 460 470 480 

mll7-l.pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I i I I I I I i I I I I I I I I 
all 7-1 GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 

430 440 450 460 470 480 

490 500 510 520 530 540 

mll7-l . pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
I I I I I I I I t I j i I t I I I I I t I I i I I I I I I I I I I i I I I I I I I I I I I I I I i I I I I I I I I I I i 
a 11 7 - 1 VNWLYEGWVKSNKAIGKI RAY I RQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 17-1. pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 
I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I i I I I I I I I I I 
all7-l KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

mll7-l.pep GLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I I I I I I I I I I I I I I t I I I : I I I I I : I I I I I I I I I I I I I : I I I I I I I I t I I I 1 I I I I I I I I 
all7-l GLMTTLAKCCKPAPPDDIVGFVTRDRGISVHRKTCPSFRHLAEHAPEKVLDASWAALQEG 

610 620 630 640 650 660 

670 680 690 700 710 720 

ml 17-1 . pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I i I 1 I I I I I I I I I : I I I I 
all7-l QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVTDLPR 

670 680 690 700 710 720 

730 

mll7-l .pep VLASLGDVKGVLSVTRLX 
I i I I I I I I I t I I I I I I I I 
all7-l VLASLGDVKGVLSVTRLX 
730 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 423>: 

gllB . seq 

1 ATGTGCGAGT TCAAGGATTT TAGAAGAAAC ATCCCTTGTT TTGAAGAGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAGCTG GAGAATGATT TAATcgaGGT TAGGAGAAAA 

151 TATCCTTATC CGATGGATAT ACCAAGGGAT ATTGTGATTG GAATCGGTAC 

201 CATTATTGAT TTTTTAATGG TTCCAAATTG GGAGCTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGCGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCAC7VACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAaCA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 424; ORF 1 18.ng>: 

gllS.pep 

1 MCEFKDFRRN IPCFEEYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRRK 

51 YPYPMDIPRD IVIGIGTIID FLMVPNW ELF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 425>: 

mll8 . seq 

1 ATGTGTGAGT TCAAGGATAT TATAAGAAAC GTTCCTTATT TTGAGGGGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAGTTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CGATGGACAT ACCAAGATAT GTTGTCATTG GAATCGGTAC 

201 CATTATTGAT TTCTTAATGG TTCCAAATTG GAAACTTTTT GA7VATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 426; ORF 11 8>: 

mll8 .pep 

1 MCEFKDIIRN VPYFEGYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRY WIGIGTIID FLMVPNW KLF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. ^gonorrhoeae 

ORF 1 18 shows 92.8% identity over a 125 aa overlap with a predicted ORF (ORF 1 18.ng) 
from A^. gonorrhoeae: 

mlie/gllS 

10 20 30 40 50 60 

MCEFKDIIRNVPYFEGYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDIPRY 
I I I j I I : 11:1 II II II I II I I I II II M M I I I I I II I I I I II I : II II I II II I 
MCEFKDFRRNIPCFEEYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRRKYPYPMDIPRD 
10 20 30 40 50 60 

70 80 90 100 110 120. 

WIGIGTIIDFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 
: II I I I I I i I I I I I I I I : I I I I I I I II I I I I I I I I I I I I I I I I I I I i I I I I I I I II I I I I 
IVIGIGTIIDFLMVPNWELFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 
70 80 90 100 110 120 



mll8 .pep 
gll8 

mll8 .pep 
gll8 



ml 18. pep YYNKKX 
I I I I I I 

gllB YYNKKX 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 427>: 

alls . seq 

1 ATGTGTGAGT TCAAGGATTT TAGAAGAAAC ATCCCTTGTT TTGMGAGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAATTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CGATGGATAT ACCAAGGGAT ATTGTGATTG GAATCGGTAC 

201 CATTATTGAT TTTTTAATGG TTCCAAATTG GGAGCTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ED 428; ORF 118.a>: 
alld.pep 

1 MCEFKDFRRN IPCFEEYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRD IVIGIGTIID FLMVPNW ELF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 



iull8/all8 93.6% identity in 125 aa overlap 

10 20 30 40 50 60 

mllS.pep MCEFKDIIRNVPYFEGYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDIPRY 

I I I t I I : 11:1 II II II I I I I II II II II I I II II I II I I I II I I M t II M II I I 
all8 MCEFKDFRRNIPCFEEYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDIPRD 

10 20 30 40 50 60 



wo 99/57280 




PCT/US99/09346 



70 80 90 100 110 120 

mllS.pep VVIGIGTIIDFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 
: I I I I I I I I I I I I I I I I : I I I M I I I n I I I I I I I I I I I I I I I I II [ I I I I I I I I 11 I 1 I 
all8 IVIGIGTIIDFLMVPNWELFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 

70 80 90 100 110 120 



ml 18. pep YYNKKX 
I I I I I I 



all8 YYNKKX 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 429>: 

gl20 . seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAAGGCTACC CCAATCCGCC GTGCTGCACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAATCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTGCCTACT 

251 ATAAAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCCAA ATTCGCCGAC 

301 GGCAGCGTAA CCTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

401 CGAAACTCCC CCCGGGTCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

451 GTCGGCGGCC TGAATAAGGC GGGTACGGGA AAATACAGCA Taggcggcgt 

501 gGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATACGGTAA 

551 CGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

601 ACCGAcgaCG GCAAAACCTA TACGCTGAAG CTCAAATCGG TGCAGATCAA 

651 CGGACAGGCC GCCAAACCGT AA 

This corresponds to the amino acid sequence <SEQ ID 430; ORF 120.ng>: 

gl20 .pep 

1 MMKTFKNIFS AAILSAALPC AYA ARLPQSA VLHYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PAYYKDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DTVTYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 43 1>: 

ml20.seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGmACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 

251 ATAGAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCcAA ATTCGCCGAC 

301 GGCAGCGTAA CTTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

401 CGAAACTCCC CCCGGGGCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

451 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAATACAGCA TAGGCGGCGT 

501 GGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 

551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

601 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 

651 CGGCCAGGCA GCCAAACCG 

This corresponds to the amino acid sequence <SEQ ID 432; ORF 120>: 

ml20 .pep 

1 MMKTFKNIFS AAILSAALPC AYA AGLPQSA VLXYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DAVMYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 120 shows 97.3% identity over a 223 aa overlap with a predicted ORF (ORF 120.ng) 
fix)m A^. gonorrhoeae: 

ml20/gl20 

10 20 30 40 50 60 

ml2 0 .pep MMKTFKNIFSAAILSAALPCAYAAGLPQSAVLXYSGSYGIPATMTFERSGNAYKIVSTIK 

IIIIIIIIIIIIIIIIIMIIIII lllllll IIIIIIIIIIIIIMIIIIIIIIIIII 

g 1 2 0 MMKTFKNI FSAAI LS AALPCAYAARLPQS AVLH YSGS YGI PATMTFERSGNAYKI VST I K 

10 20 30 40 50 60 

70 .80 90 100 110 120 

ml20 .pep VPLYNIRFESGGTWGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

llllllllilllllllllllhlhllllllllllllllllllllllllllMIIIIIII 

gl20 VPLYNIRFESGGTWGNTLHPAYYKDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 
70 80 90 100 . 110 120 

130 140 150 160 170 180 

ml20 . pep DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEWKYRVRRGD 

IIMIIIIIIIIMMIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIII 

gl20 DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEWKYRVRRGD 
130 140 150 160 170 180 

190 200 210 220 

ml2 0 . pep DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKP 

hi IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 
gl2 0 DTVTYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 
190 200 210 220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 433>: 

al20.seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGCACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 

251 ATAGAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCCAA ATTCGCCGAC 

301 GGCAGCGTAA CCTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

401 CGAAACTCCC CCCGGGGCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

451 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAATACAGCA TAGGCGGCGT 

501 GGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 

551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

601 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 

651 CGGCCAGGCA GCCAAACCGT AA 

This corresponds to the amino acid sequence <SEQ ID 434; ORF 120.a>: 
al20.pep 

1 MMKTFKNIFS AAILSAALPC AYA AGLPQSA VLHYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DAVMYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP* 

ml20/al20 99.6% identity in 223 aa overlap 

10 20 30 40 50 60 

ml20.pep MMKTFKNIFSAAILSAALPCAYAAGLPQSAVLXYSGSYGIPATMTFERSGNAYKIVSTIK 

I M M M I N I I I I i I I I I I I I I I I I I I t I I I I I I I M I I I ] I I I i I 

al20 MMKTFKNIFSAAILSAALPCAYAAGLPQSAVLHYSGSYGIPATMTFERSGNAYKIVSTIK 



• 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ml20 . pep VPLYNIRFESGGTWGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 
I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al20 VPLYNIRFESGGTWGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ral20 . pep DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEWKYRVRRGD 
I I i I I I I I I M I I i I I I I I I i I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I M I I 
al20 DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEWKYRVRRGD 

130 140 150 160 170 180 

190 200 210 220 

ml20 . pep DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 
I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I i I I I I I 1 I I I I j 
al20 DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 

190 200 210 220 



The following partial DNA sequence was identified in gonorrhoeae <SEQ ID 435>: 

9I21 .seq 

1 ATGGAAACAC 

51 GGCGGATGCC 

101 AAGGGCACGC 

151 GATTTGCAGG 

201 GCAAGAACTC 

251 GTCAAAACCT 

301 ACCGTCCGAC 

351 GCCGCTGCTG 

4 01 GCCGCGACCT 

451 CACGAAGCCC 

501 CGGCGGGATT 

551 GCTTCGACAC 

601 cacTGGcagc 

651 catatTGCcg 

701 AACCCcaccc 

751 gaaacctAcc 

801 ttcccgattc 

851 CAGATGCCCG 

901 TTAATGGCGG 

951 CACCGCCGAA 

1001 cgtggttggC 

1051 GCGACCGGCG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 436; ORF 121.ng>: 

gl21 .pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRVVLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 437>: 

ml21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 



AGCTTTACAT CGGCATTATG 
GTGCTGGTAC GGATGGACGG 
CTTTACCCCC TACCCTGACC 
ACACAGGCAC AGACG7UVCTG 
AGCCGCCTGT ACGCGCAAAC 
CGCTCCGTGC GACATTACCG 
ACGCGCCGGA ACACGGTtac 
GCGGAACTGa cgcggatttT 
TGCTGCCGGC GGacaAGGTG 
TGTTCCGCGA TGACAGGGAA 
GCCAACATCA GCGTACTCCC 
AGGGCCGGGC AATATGCTGA 
TGCCTTACGA CAAAAacggt 
cAACTGCTCG gcaggctGCT 
aaAAAGCACG GGgcGCGaac 
ttgacggcgg cgaaaaccga 
accgcgcaaA ccgTttggga 
TCAAATGTAC ATTTGCGGCG 
ATTTGGCAGA ATGTTTCGGC 
CTGAACCTCG ATCCTCAATG 
GGCGTGTTGG ATTAACCGCA 
CATCCAAACC GTGTATTCTG 



TCGGGAACCA GTATGGACGG 
CGGCAAATGG CTGGGCGCGG 
GGTTGCGCCG CAT^TTGCTG 
CACCGCAGCA GGATGTTGTC 
CGCCGCCGAA CTGCTGTGCA 
CCCTCGGCTG CCACGGGCAA 
AGCATACAGC TTGCCGATTT 
TACCGTCggc gacttcCGCA 
CGCCGCTCGT CCCCGCCTTT 
ACACGCGTGG TACTGAACAT 
CCCCGGCGCA CCCGCCTTCG 
TGGAcgcgtg gacgcaggca 
gcAAAGgcgg cacAAGGCAA 
CGCCcaccCG TATTTCTCAC 
TgtttgcccT AAattggctc 
tacgacgtat tgcggacgct 
cgccgtctca CACGCAGCGG 
GCGGCATCCG CAATCCTGTT 
ACACGCGTTT CCCTGCACAG 
GGTGGAGGCG gccgCATTtg 
TTCCCGGTAG TCCGCACAAA 
GGCGCGGGAT ATTATTATTG 
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151 


GATTTGCAGG 


ACACAGGCGC 


201 


GCAAGAACTC 


AGCCGCCTAT 


251 


GTCAAAACCT 


CGCACCGTCC 


301 


ACCGTCCGAC 


ACGCGCCGGA 


351 


GCCGCTGCTG 


GCGxxxxxxx 


401 


xxxxxxxxxx 


xxxxxxxxxx 


451 


xxxxxxxxxx 


xxxxxxxxxx 


501 


xxxxxxxxxx 


xxxxxxxxxx 


551 


xxxxxxxxxx 


xxxxxxxxxx 


601 


xxxxxxCAGC 


TTCCTTACGA 


651 


CATATTGCCG 


CAACTGCTCG 


701 


AACGCCACCC 


TAAAAGCACG 


751 


GAAACCTACC 


TTGACGGCGG 


801 


TTCCCGTTTT 


ACCGCGCAAA 


851 


CAGATGCCCG 


TCAAATGTAC 


901 


TTAATGGCGG 


ATTTGGCAGA 


951 


CACCGCCGAC 


CTGAACCTCG 


1001 


CGTGGTTGGC 


GGCGTGTTGG 


1051 


GCAACCGGCG 


CATCCAAACC 


1101 


A 





AGACGAACTG CACCGCAGCA GGATTTTGTC 
ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 
GACATTACCG CCCTCGGCTG CCACGGGCAA 
ACACGGTTAC AGCATACAGC TTGCCGATTT 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 
CAAAAACGGT GCAAAGTCGG CACAAGGCAA 
ACAGGCTGCT CGCCCACCCG TATTTCGCAC 
GGGCGCGAAC TGTTTGCCAT AAATTGGCTC 
CGAAAACCGA TACGACGTAT TGCGGACGCT 
CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 
ATTTGCGACG GCGGCATCCG CAATCCTGTT 
ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
ATCCGCAATG GGTGGAAGCC GCCGnATTTG 
ATTAATCGCA TTCCCGGTAG TCCGCACAAA 
GTGTATTCTG AnCGCGGGAT ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 438; ORF 121>: 

Z11I21 .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

Computer analysis of the amino acid sequences gave the following results: 
Homology with a predicted ORF from K meningitidis menA with menB 
ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (0RF121.ng) 
from A^. gonorrhoeae: 

ml21/gl21 



10 20 30 40 50 60 

ml21.pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I I I i I I I I I I I I I I I i I I I I : I I I I I I I I I M I I I I I I 1 I I I ] I : I I I I I I I I : I I i 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I : I I I I I I t I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g 12 1 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYS IQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

inl21 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml2 1 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: : I I i I 1 I I M I : I I I i I I I I I I I I I I I I ] I : I I | | M 

gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml21 . pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I M : I I I I I I I I I I I I i I I I I I I i I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 
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310 320 330 340 350 360 

ml21 . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I I I I I I I I I I I I I I I I : I I I I i I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I 
gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 439>: 

al21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 



This corresponds to the amino acid sequence <SEQ ID 440; ORF 121.a>; 

al21.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21/al21 74.0% identity in 366 aa overlap 



10 20 30 40 50 60 

ml21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I 1 I I I I I I I I I I I I I I I I i I I I i I I I I t i I I I I I I I I I I I I I M I : I I n I I i i I I I I 
al21 METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 

liflMIMII Illllllll liMIIMIIIIIIIhlhlllMMI 

al21 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 
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130 140 150 160 170 180 

ml2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
al21 AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: I I I i I I M I t : I I I j I I I I I I I I I I I i I I I I I 1 I I I I 

al21 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

nil21 . pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I 1 I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21 . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I t I I I t I I i I i I I I I I : I I I I I I I I I I I I I I : I I I I : 1) I I I I I i I t I I I I I I ) I I 
al21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21 .pep 
al21 



XAGYYYX 
I I I I I I 
GAGYYYX 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 441>: 

ml21-1.8eq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

401 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCT^CATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGNATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG ANCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ JD 442; ORF 121-1>: 

nil21-l.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AERTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDNRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

ml21-l/gl21 95.6% identity in 366 aa overlap 

10 20 30 40 50 60 
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ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPyPGRLRRQLLDLQDTGADEL 
llt[)lllllllllilllllll:lllllllllllllllllll lll|:llllllll:tll 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml21-l . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
M I I : I I I I I I I I I I I t I I I I I I ) I I I It ) I I I I I I I I I I I I I 1 I I I I I I I I t I I I I I I 
gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l . pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGXANISVLPPDA 

II llllllllltllllllllllllllllllllllll:llll:|llllllilllllll I 
gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l . pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
I I I II I I I I II I I II M I I II II I I I 1 I I I II II I I II II I I I I I I I I I II : I II II II 
gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l . pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
llllllllllltlllllllltllllllllllllll I I I 1 I I I I I I I I I I I I I I I I I I I I 
gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 

III llltii III llllll 1:111111 III II lltlMllllllltllllllllllllll 
gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
llllll 

gl21 GAGYYYX 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 443>: 

al21-1.8eq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

401 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ E) 444; ORF 121-l.a>: 

al21-l.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 
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201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21-l/al21-l 96.4% identity in 366 aa overlap 



10 20 30 40 50 60 

ml2 1-1 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 

lllllllllllllltllllllilllllllllllllllllllllltlhllllllllltll 

al21-l METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPyPGRLRRKLLDLQDTGADEL 
10 20 30 40 50 60 



70 80 90 100 110 120 

ral21-l . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
llll:|IIIIIIIMIIIIIIIIIIIItllllll)llllllllilll|:||:|lllllll 
al21-l HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml21-l.pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 
llll:||llllltlllllllllllllllllllllll|:|lllllillllllllillllll 
al2 1- 1 AERTQI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANI SVLPPDA 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml2 1 - 1 . pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
I I I I I I I I I I I I M I I 1 I I I I I i I I I I I I I I I I I t I I I I 1 I I I I I I I I I I I I I I I I I I t 
al21-l PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230. 240 

250 260 270 280 290 300 

ml21-l . pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMY ICGGGIRNPV 
I I I I I I I I I I I I t I I I i I I I I 1 I I I I I I I I I I I I I I i i I I I I I ) I I I I I I I I I I I I I I I 
a 1 2 1 - 1 GRELFALNWLET YLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGI RNPV 

250 260 270 280 290 300 



310 320 330 340 350 360 

ral21-l . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I : I I I I : I I I I I I I I I I I I I I I I I I I 
a 12 1 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRI PGSPHKATGASKPCIL 

310 320 330 340 350 360 



ral21-l.pep XAGYYYX 
I I I I I I 

al21 GAGYYYX 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 445>: 

gl22.seq 

1 ATGGCTTTAC TGAGCATCCG CAAGCTGCAC AAACAATACG GCAGCGTAAC 

51 CGCCATCCAA TCCTTAGACT TGGACTTGGA TVAAAGGCGAA GtcatCGTAC 

101 TGCTGGGCCC gTccggctgc ggCAAATCCA CCCTcctgcg ctgcgtcaaC 

151 GGTTTGGAGC CGCACCAagg cgGCAGCATC GTGATGGACG GTgtcgGCGA 

201 ATTCggcAAA GACGTTTCCT GGCAAACCGC CCGGCAAAAa gtcggtatgg 

251 tctttcaaag taacgAactg Tttgcccaca tgaccgtcat cgAaaacatc 

301 ttcttAggcC CGGTAAagga aCAAAAcCgc gaccgtgccg aagcaGAGGC 

351 gCAAGCCGGC AAactGttgg aacgcgTCGG actgctAGAC CGCAAAAACG 

4 01 CCTATCCGCG CGAACTTTCC GGCGGTCAGA AACAGCGCAT CGCCATTGTC 

451 CGCGCCCTGT GCCTGAATCC GGAAGTCATC CTGCTGGACG AAATCACCGC 

501 CGCACTTGAC CCCGAAATGG TGCGCGAAGT CTTGGAAGTG GTTTTGGAAC 

551 TCGCCCGCGA AGGGATGAGT ATGCTCATCG TAACCCACGA AATGGGGTTC 

601 GCACGCAAAG TTGCCGACCG CATCGTCTTT ATGGACAAAG GCGGCATCGT 

651 CGAATCGTCC GACCCCGAAA CCTTTTTTTC CGCACCAAAA AGCGAACGCG 

701 CCCGCCAATT TCTGGCAGGT ATGGACTACT GA 

This corresponds to the amino acid sequence <SEQ ID 446; ORF 122.ng>: 
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gl22 .pep 



1 MALLSIRKLH KQYGSVTAIQ SLDLDLEKGE VIVLLGPSGC GKSTLLRCVN 

51 GLEPHQGGSI VMDGVGEFGK DVSWQTARQK VGMVFQSNEL FAHMTVIENI 

101 FIjGPVKEQNR DRAEAEAQAG KLLERVGLLD RKNAYPRELS GGQKQRIAIV 

151 RALCLNPEVI LLDEITAALD PEMVREVLEV VLELAREGMS MLIVTHEMGF 

201 ARKVADRIVF MDKGGIVESS DPETFFSAPK SERARQFLAG MDY* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 447>: 

ml22 . seq 

1 GTTGTCATGA TTAAAATCCG CAATATCCAT AAGACCTTTG GCGAAAACAC 



51 TATTTTGCGC GGCATCGATT TGGATGTGTG CAAAGGGCAG GTGGTCGTCA 

101 TCCTCGGGcC TTCCGGCTCA GGCAAAACGA CGTTTCTGCG ATGCCTAAAC 

151 GCGTTGGAAA TGCCCGAAGA CGGACAAATC GAGTTCGACA ACGAGCGACC 

201 GCTGAAAATC GATTTTTCTA AAAAACCAAG CAAACACGAT ATTTTGGCAC 

251 TGCGCCGCAA ATCAJcGCATG GTGTTTCAAC AATACAAyCT CTTTCCGCAC 

301 AAAACCGCCT TGGAAAACGT AATGGAAGGA CCGGTTGCCG TACAgGGCAA 

351 GCCTGCCGCC CAAGCGCGCG AAGAGGCTCT GAAACTGCTG GAAAAAGTCG 

401 GCTTGGGCGA CAAAGTGGAT TTGTATCCCT ACCAGCTTTC CGGCGGTCAG 

451 CAGCAGCGCG TCGGCATTGC CCGCGCATTG GCGATTCAGC CTGAACTGAT 

501 GCTGTTTGAC GAACCGACTT CCGCGCTCGA TCCTGAATTG GTGCAAGATG 

551 TTTTGGATmC CATGAAGGAA TTGGCGCAAG AAGGCTGGAC CATGGTTGTC 

601 GTTACGCATG AAATCAAGTT CGCCTTAGAA GTGGCAACCA CCGwCGTCGT 

651 GATGGACrGC GGCGTTATTG TCGAACAAGG CAGCCCGCAA GATTTGTTCG 

701 ACCACCCCAA ACACGAACGG ACGCGGAGAT TTTTAAGCCA AATCCAATCT 

751 ACCAAGATTT GA 



This corresponds to the amino acid sequence <SEQ ID 448; ORF 122>: 

ml22 .pep 

1 WMIKIRNIH KTFGENTILR GIDLDVCKGQ VWIIiGPSGS GKTTFLRCLN 



51 7VLEMPEDGQI EFDNERPLKI DFSKKPSKHD ILALRRKSXM VFQQYNLFPH 

101 KTALENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLSGGQ 

151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLDXMKE LAQEGWTMW 

201 VTHEIKFALE VATTXWMDX GVIVEQGSPQ DLFDHPKHER TRRFLSQIQS 

251 TKI* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 122 shows 47.2% identity over a 246 aa overlap with a predicted ORF (ORF 122.ng) 
firom N, gonorrhoeae: 

ml22/gl22 



Tnl22 .pep 



10 20 30 40 50 60 

WMIKIRNIHKTFGENTILRGIDLDVCKGQVWILGPSGSGKTTFLRCLNALEMPEDGQI 



gl22 




10 20 30 40 50 



ml22 .pep 



70 80 90 100 110 120 

EFDNERPLKIDFSKKPSKHDILALRRKSXMVPQQYNLFPHKTALENVMEGPVAVOGKPAA 



gl22 





70 80 90 100 110 



ml22 .pep 



130 140 150 160 170 180 

QAREEALKLLEKVGLGDKVDIjYP YQLSGGQQQRVG I ARALAIQ PELMLFDE PTS ALDPEL 



gl22 




120 130 140 150 160 170 



ml22 .pep 



190 200 210 220 230 240 

VQDVLDXMKEIAQEGWTIWVVTHEIKFALEVATTXVVMDXGVIVEQGSPQDLFDHPKHER 



gl22 
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180 190 200 210 220 230 

250 

ml22 . pep TRRFLSQIQSTKIX 

gl22 ARQFLAGMDYX 
240 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 449>: 

al22.seq 

1 GTTGTCATGA TTAAAATCCG CAATATCCAT AAGACCTTCG GCAAAAATAC 

51 CATTTTGCGC GGCATCAATT TGGATGTGTG CAAAGGGCAG GTGGTCGTCA 

101 TCCTCGGGCC TTCCGGCTCA GGCAAAACGA CGTTTCTGCG ATGCCTAAAC 

151 GCGTTGGAAA TGCCCGAAGA CGGACAAATC GAGTTCGACA ACGAGCGACC 

201 GCTGAAAATC GATTTTTCTA AAAAACCAAG CAAACACGAT ATTTTGGCAC 

251 TGCGCCGCAA ATCAGGCATG GTGTTTCAAC AATACAACCT CTTTCCGCAC 

301 AAAACCGCCT TGGAAAACGT GATGGAAGGA CCGGTTGCCG TACAGGGCAA 

351 GCCTGCCGCC CAAGCGCGCG AAGAGGCTCT GAAACTGCTG GAAAAAGTCG 

401 GCTTGGGCGA CAAAGTGGAT TTGTATCCCT ACCAGCTTTC CGGCGGTCAG 

451 CAGCAGCGCG TCGGCATTGC CCGAGCATTG GCGATTCAGC CCGAGCTGAT 

501 GTTGTTTGAC GAACCCACTT CCGCGCTTGA CCCCGAGTTG GTGCAAGACG 

551 TGTTGAACGC CATGAAGGAA TTGGCGCGGG AAGGTTGGAC GATGGTCGTC 

601 GTTACCCACG AAATCAAGTT CGCGCTGGAA GTTGCCACGA CCGTTGTCGT 

651 GATGGACGGC GGCGTTATCG TAGAGCAGGG CAGCCCGAAA GAGTTGTTCG 

701 ACCACCCCAA ACACGAACGG ACGCGGAGAT TTTTAAGCCA AATCCAATCT 

751 ACCAAGATTT GA 

This corresponds to the amino acid sequence <SEQ ID 450; ORF 122.a>: 

al22.pep 

1 WMIKIRNIH KTFGKNTILR GINLDVCKGQ VWILGPSGS GKTTFLRCLN 

51 ALEMPEDGQI EFDKERPLKI DFSKKPSKHD ILALRRKSGM VFQQYNLFPH 

101 KTALENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLSGGQ 

151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLNAMKE LAREGWTMVV 

201 VTHEIK FALE VATTWVMDG GVIV EQGSPK ELFDHPKHER TRRFLSQIQS 

251 TKI* 

ml22/al22 96.0% identity in 253 aa overlap 

10 20 30 40 50 60 

inl22.pep WMIKIRNIHKTFGENTILRGIDLDVCKGQWVILGPSGSGKTTFLRCLNALEMPEDGQI 
IIIIIMIIIIIII:|lillll:lllllllllillllllllillllllllMlillllll 
al22 WMIKIRNIHKTFGKNTILRGINLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQI 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml22 . pep EFDNERPLK I DFSKKPSKHDILALRRKSXMVFQQYNLFPHKTALENVMEG PVAVQGKPAA 

I I I I I M I I I I I I I I I M 1 I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I i I I I I I I I I I 
al22 EFDNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAA 
70 80 90 100 110 120 

130 140 150 160 170 180 

inl22.pep QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 
I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I i I I I I I I I I I I I I I f 
al22 QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 

130 140 150 160 170 ISO 

190 200 210 220 230 240 

ml22 . pep VQDVLDXMKELAQEGWTMVWTHEIKFALEVATTXWMDXGVIVEQGSPQDLFDHPKHER 
Mill: I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M :: I I I I I I t I I 
al22 VQDVLNAMKELAREGWTMWVTHEIKFALEVATTVWMDGGVIVEQGSPKELFDHPKHER 

190 200 210 220 230 240 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 451>: 

gl22-l .seq 

1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACCATTTT 

51 GCGCGGCATC GATTTGGATG TGGGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCGGGTAAA ACAACATTTC TGCGCTGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGCGC GGCCGTTACG 

201 CATTGATTTT TCCAAAAAAA CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAGTCCGG AATGGTATTC CAACAATACA ACCTCTTCCC GCATAAAACC 

301 GTGTTGGAAA ACGTGATGGA AGGGCCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTTTGAAACT GCTGGAAAAA GTCGGCTTGG 

401 GCGATAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

451 CGTGTCGGTA TCGCCCGCGC ACTGGCGATT CAGCCTGAAT TGATGCTGTT 

501 TGACGAACCC ACTTCCGCGC TGGACCCCGA GTTGGTGCAA GACGTGTTGG 

551 ACGCCATGAA GGAATTGGCG CGGGAAGGTT GGACGATGGT CGTCGTTACC 

601 CACGAAATCA AGTTCACGCT GGAAGTTGCC ACGAACGTCG TCGTGATGGA 

651 CGGCGGCGTT ATCGTAGAGC AGGGCAGCCC GAAAGAGTTG TTCGACCACC 

701 TCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTGCCAAG 

751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ED 452; ORF 122-1. ng>: 

gl22-l.pep 

1 MIKIRNIHKT FGENTILRGI DLDVGKGQW VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNARPLRIDF SKKTSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 VLENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDAMKELA REGWTMWVT 

201 HEIKFTLEVA TNVVVMDGGV IVEQGSPKEL FDHLKHERTR RFLSQIQSAK 

251 I* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 453>: 

jnl22-1.8eq 

1 ATGATTAAAA TCCGCAATAT 

51 GCGCGGCATC GATTTGGATG 

101 GGCCTTCCGG CTCAGGCAAA 

151 GAAATGCCCG AAGACGGACA 

201 AATCGATTTT TCTAAAAAAC 

251 GCATUVTCAGG CATGGTGTTT 

301 GCCTTGGAAA ACGTAATGGA 

351 CGCCCAAGCG CGCGAAGAGG 

401 GCGACAAAGT GGATTTGTAT 

451 CGCGTCGGCA TTGCCCGCGC 

501 TGACGAACCG ACTTCCGCGC 

551 ATACCATGAA GGAATTGGCG 

601 CATGAAATCA AGTTCGCCTT 

651 CGGCGGCGTT ATTGTCGAAC 

701 CCAAACACGA ACGGACGCGG 

751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 454; ORF 122-1>: 

inl22-l.pep 

1 MIKIRNIHKT FGENTILRGI DLDVCKGQVV VILGPSGSGK TTFLRCLNAL 
51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 
101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 
151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDTMKELA QEGWTMVWT 
201 HEIKFALEVA TTWVMDGGV IVEQGSPQDL FDHPKHERTR RFLSQIQSTK 
251 I* 

ml22-l/gl22-l 94.8% identity in 251 aa overlap 

10 20 30 40 50 60 

ml22-l . pep MlKIRNIHKTFGENTILRGIDLDVCKGQVWILGPSGSGKTTFLRCLNALEMPEDGQIEF 
I i I t I I I I I I I I I I I I I I I I I I I I IlillllllllllttllllllllNIIIIIIIIII 
gl22-l MIKIRNIHKTFGENTILRGIDLDVGKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 



CCATAAGACC 
TGTGCAAAGG 
ACGACGTTTC 
AATCGAGTTC 
CAAGCAAACA 
CAACAATACA 
AGGACCGGTT 
CTCTGAAACT 
CCCTACCAGC 
ATTGGCGATT 
TCGATCCTGA 
CAAGAAGGCT 
AGAAGTGGCA 
AAGGCAGCCC 
AGATTTTTAA 



TTTGGCGAAA 
GCAGGTGGTC 
TGCGATGCCT 
GACAACGAGC 
CGATATTTTG 
ACCTCTTTCC 
GCCGTACAGG 
GCTGGAAAAA 
TTTCCGGCGG 
CAGCCTGAAC 
ATTGGTGC7VA 
GGACGATGGT 
ACCACCGTCG 
GCAAGATTTG 
GCCAAATCCA 



ACACTATTTT 
GTCATCCTCG 
AAACGCGTTG 
GACCGCTGAA 
GCACTGCGCC 
GCACAAAACC 
GCAAGCCTGC 
GTCGGCTTGG 
TCAGCAGCAG 
TGATGCTGTT 
GATGTTTTGG 
TGTCGTTACG 
TCGTGATGGA 
TTCGACCACC 
ATCTACCAAG 
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g 122 -1 DNARPLRI DFSKKTSKHDILALRRKSGMVFQQYNLFPHKTVLENVMEGPVAVQGKPAAQA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml22-l.pep REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 

liiiiiiiiiiiiiiiiiniiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

g 122-1 REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml22-l . pep DVLDTMKELAQEGWTMVVVTHEIKFALEVATTVVVMDGGVIVEOGSPQDLFDHPKHERTR 
llll:|||||:Illllllllllllt:IMII:ll]llltllll)lli::|ll) llllll 
g 1 2 2 - 1 DVLDAMKELAREGWTMWVTHEIKFTLEVATN VVVMDGGVI VEQGS PKELFDHLKHERTR 

190 200 210 220 230 240 

250 

ral22-l .pep RFLSQIQSTKIX 
11111111:111 
gl22-l RFLSQIQSAKIX 
250 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 455>: 

al22-l .seq 

1 ATGATTAAAA 

51 GCGCGGCATC 

101 GGCCTTCCGG 

151 GAAATGCCCG 

201 AATCGATTTT 

251 GCAAATCAGG 

301 GCCTTGGATW^ 

351 CGCCCAAGCG 

401 GCGACAAAGT 

451 CGCGTCGGCA 

501 TGACGAACCC 

551 ACGCCATGAA 

601 CACGAAATCA 

651 CGGCGGCGTT 

701 CCAAACACGA 

751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 456; ORF 122-1. a>: 

al22-l.pGp 

1 MIKIRNIHKT FGKNTILRGI NLDVCKGQVV VILGPSGSGK TTFLRCLNAL 
51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 
101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 
151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLNAMKELA REGWTMWVT 
201 HEIKFALEVA TTVWMDGGV IVEQGSPKEL FDHPKHERTR RFLSQIQSTK 
251 I* 

aX22-l/inl22-l 97.2% identity in 251 aa overlap 

10 20 30 40 50 60 

MIKIRNIHKTFGKNTILRGINLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 
llllllll!lll:lllllll:|||IMIMIIIIIIItllllllllltllllllllllll 
MIKIRNIHKTFGENTILRGIDLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 
10 20 30 40 50 60 

70 80 90 100 110 120 

DNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 
llllllltllllllltllillllllllllllilllMIIIIIIIIMIIIIilllMIII 
DNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 
70 80 90 100 110 120 

130 140 150 160 170 180 

REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 
MMIIMIIIIIIIIIIItlllllllllllllllllllllllllllllMlltllllll 
REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 



TCCGCAATAT CCATAAGACC 
AATTTGGATG TGTGCAAAGG 
CTCAGGCAAA ACGACGTTTC 
AAGACGGACA AATCGAGTTC 
TCTAAAAAAC CAAGCAAACA 
CATGGTGTTT CAACAATACA 
ACGTGATGGA AGGACCGGTT 
CGCGAAGAGG CTCTGAAACT 
GGATTTGTAT CCCTACCAGC 
TTGCCCGAGC ATTGGCGATT 
ACTTCCGCGC TTGACCCCGA 
GGAATTGGCG CGGGAAGGTT 
AGTTCGCGCT GGAAGTTGCC 
ATCGTAGAGC AGGGCAGCCC 
ACGGACGCGG AGATTTTTAA 



TTCGGCAAAA ATACCATTTT 
GCAGGTGGTC GTCATCCTCG 
TGCGATGCCT AAACGCGTTG 
GACAACGAGC GACCGCTGAA 
CGATATTTTG GCACTGCGCC 
ACCTCTTTCC GCACAAAACC 
GCCGTACAGG GCAAGCCTGC 
GCTGGAAAAA GTCGGCTTGG 
TTTCCGGCGG TCAGCAGCAG 
CAGCCCGAGC TGATGTTGTT 
GTTGGTGCAA GACGTGTTGA 
GGACGATGGT CGTCGTTACC 
ACGACCGTTG TCGTGATGGA 
GAAAGAGTTG TTCGACCACC 
GCCAAATCCA ATCTACCAAG 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 457>: 

gl25 . seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGGT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACGCTGC 

101 TCGCCCCCTT GGGCTGGCAG CGCGGTCTGG CGGCCCTGCT TTTGGGTCAT 

151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

201 CGGACGCAGC TCGATGGAAA GTGTGCGCCT GTCGTTCGGC AAATGCGGTT 

251 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

301 GTGATGATTT ACGTCGGCGC AacggTCAGC TCCGCTTTGG GCTU^GTGTT 

351 GTGGGACggc gaATCCTTTG TCTGGTGGGC ATTGGCAT^C GGCGCACTGA 

4 01 TCGTGCTGTG GCTGGTTTTC GGCGCACGCA GAACGGGCGG GCTGAAAACC 

451 GTTTCGATGC TGCTGATGCT GCTTGCCGTG TTGTGGTTGA GCGTCGAAGT 

501 GTTCGCTTCG TCCGGCACAA ACGCCGCGCC CGCCGTTTCA GACGGCATGA 

551 CCTTCGGAAC GGCAGTCGAA CTGTCCGCCG TCATGCCGCT TTCCTGGCTG 

601 CCGCTGGCCG CCGACTACAC GCGCCAAGCA CGCCGCCCGT TTGCGGCAAC 

651 CCTGACGGCA ACGCTCGCCT ATACGCTGAC GGGCTGCTGG ATGTATGCCT 

701 TGGGTTTGGC GGCGGCTCTG TTTACCGGAG AAACCGACGT GGCGAAAATC 

751 CTGTTGGGCX; CGGGCTTGGG CATAACGGGC ATTCTGGCAG TCGTCCTCTC 

801 CACCGTTACC ACAACGTTTC TCGATACCTA TTCCGCCGGC GCGAGTGCGA 

851 ACAACATTTC CGCGCGTTTT GCGGAAATAC CCGTCGCTGT CGGCGTTACC 

901 CTGatccgca ccgtgcttgc cgtcatgctg cccgttaccg aatataaaaa 

951 cttcctgctg cttatccgct cggtatttgg gccgatggcg ggtggttttg 

1001 attgccgaCT TTTttgtCtt AAAACGGCGT GA 

This corresponds to the amino acid sequence <SEQ ID 458; ORF 125.ng>: 



The following partial DNA sequence 



ml25 .seq 






1 


ATGTCGGGCA 


ATGCCTCCTC 


51 


TTGGTTCGGC 


GCGGCGGTAT 


101 


TTGCGCCTTT 


GGGCTGGCAG 


151 


GCCGTCGGCG 


GCGCGCTGTT 


201 


CGGACGCAGC 


TCGATGGAAA 


251 


CAGTGCTGTT 


TTCCGTGGCG 


301 


GTGATGATTT 


ACGCCGGCGC 


351 


GTGGGACGGC 


GAATCTTTTG 


401 


TTGTGCTGTG 


GCTGGTTTTC 


451 


GTTTCGATGC 


TGCTGATGCT 


501 


CTTTTCCACG 


GCAGGCAGCA 


551 


TCGGAACGGC 


AGTCGAGCTG 


601 


CTTGCCGCCG 


ACTACACGCG 


651 


GACGGCAACG 


CTCGCCTACA 


701 


GTTTGGCAGC 


GGCGTTGTTC 


751 


CTGGGCGCAr 


GTTTGgGTGC 


601 


CGTTACCACA 


ACGTTTCTCG 



identified in A^. meningitidis <SEQ ID 459>: 

TCCTTCATCT TCCTCCGCCA TCGGGCTGAT 
CGATTGCCGA AATCAGCACG GGTACGCTGC 
CGCGGTCTGG CGGCTCTACT TTTGGGTCAT 
TTTTGCGGCG GCGTATATCG GCGCACTGAC 
GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 
AATATGCTGC AACTGGCCGG CTGGACGGCG 
AACGGTCAGC TCCGCTTTGG GCAAAGTGTT 
TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 
GGCGCACGCA AAACAGGCGG GCTGAAAACC 
GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 
CCGCCGCACA GGTTTCAGAC GGCATQAGTT 
TCCGCCGTGA TGCCGCTTTC CTGGCTGCCG 
CCACGCGCGC CGCCCGTTTG CGGCAACCCT 
CGCTGACC6G CTGCTGGATG TATGCCTTGG 
ACCGGAGAAA CCGACGTGGC AAAAATCCTG 
GGCAGGCATT TTGGCGGTCG TCCTCTCCAC 
ATGCCTATTC CGCCGGCGCG AGTGCGAACA 
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851 ACATTTCCGC GCGTTTTGCG GAAACACCCG TCGCTGTCrG CGTTACCCTG 

901 ATCGGCACGG TACTTGCCGT CATGCTGCCC GTTACCGAAT ATGAAAACTT 

951 CCTGCTGCTT ATCGGCTCGG TATTTGCGCC GATGGCGGgC GGTTTTGATT 

1001 GCCGACTTTT TCGTCTTGAA ACGGCGTGA 

This corresponds to the amino acid sequence <SEQ ID 460; ORF 125>: 

ml25 .pep 

1 MSGNASSPSS SSAIGLIWFG AAVSIAEIST GTLLAPLGWQ RGLAALLLGH 

51 AVGGALFFAA AYIGALTGRS SMESVRLSFG KRGSVLFSVA NMLQLAGWTA 

101 VMIYAGATVS SALGKVLWDG ES FVWWALAN GALIVLWLV F GARKTGGLKT 

151 VSML LMLIiAV LWLSAEVF ST AGSTAAQVSD GMSFGTAVEL SAVMPLSWLP 

201 LAADYTRHAR RPFAATLTAT LAYTLTGCWM YALGLAAALF TGETDVAKIL 

251 LGAXLGAAGI LAWLSTVTT TFLDAYSAGA SANNI5ARF A ETPVAVXVTL 

301 IGTVLAVMLP VTEYENFLLL. IGSVFAPMAG GFDCRLFRLE TA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 125 shows 92.1% identity over a 343 aa overlap with a predicted ORF (ORF 125.ng) 
from K gonorrhoeae: 

ml25/gl25 

10 20 30 40 50 60 

MSGNASSPSSSSAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 

llllllllllhlllhlMIIIIIIIIIIIIIIIMIIIIIMIMIIIIIIIIIIIII 

MSGNASSPSSSAAIGLVWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 
10 20 30 40 50 60 

70 80 90 100 110 120 

AYIGALTGRSSMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVSSAIiGKVLWDG 

I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I 
AYIGALTGRSSMESVRLSFGKCGSVLFSVANMLQLAGWTAVMIYVGATVSSALGKVLWDG 

70 80 90 100 110 120 

130 140 150 160 . 170 179 

ESFVWWALANGALIVLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFSTAGSTAAQ-VS 
llllllllllllllllllllllhlllllllllllllllllll|:|||:::|::|| || 
ESFVWWALANGALIVLWLVFGARRTGGLKTVSMLLMLIiAVLWLSVEVFASSGTNAAPAVS 
130 140 150 160 170 180 

180 190 200 210 220 230 239 

DGMSFGTAVELSAVMPLSWLPLAADYTRHARRPFAATLTATLAYTLTGCWMYALGLAAAL 

llhillllllllllllllllllllllhlllllllllllllllllMMIIIIIIIIII 

DGMTFGTAVELSAVMPLSWLPLAADYTRQARRPFAATLTATLAYTLTGCWMYALGLAAAL 
190 200 210 220 230 240 

240 250 260 270 280 290 299 

FTGETDVAKI LLGAXLGAAG I LAWLS WTTTFLDAYSAGAS ANN I S/^FAETPVAVXVT 

iiiiiiiiiiiiii II HiiiiiiiiiiiiiihiiiiiiiiiiMiiii nil II 

FTGETDVAKILLGAGLGITGILAWLSTVTTTFLDTYSAGASANNISARFAEIPVAVGVT 
250 260 270 280 290 300 

300 310 320 330 340 

LIGTVIiAVMLPVTEYENFLLLIGSVFAPMAGGFDCRLFRLETAX 

II lllllllllllhllllll Mhlllllllllll hill 

LIRTVIiAVMLPVTEYKNFLLLIRSVFGPMAGGFDCRLFCLKTAX 
310 320 330 340 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 461>: 

al25 . seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGAT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACACTGC 

101 TTGCGCCTTT GGGCTGGCAG CGCGGTCTGG CGGCTCTGCT TTTGGGTCAT 



nil25 .pep 
gl25 

ml25 .pep 
gl25 

ml25 .pep 
gl25 

ml25 .pep 
gl25 

ml25 .pep 

gi25 

ral25 .pep 

gi25 
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151 


GCCGTCGGCG 


201 


CGGACGCAGC 


251 


CAGTGCTGTT 


301 


GTGATGATTT 


351 


GTGGGACGGC 


401 


TTGTGCTGTG 


451 


GTTTCGATGC 


501 


CTTTTCCACG 


551 


TCGGAACGGC 


601 


CTGGCCGCCG 


651 


GACGGCAACG 


701 


GTTTGGCAGC 


751 


CTGGGCGCAG 


801 


CGTTACCACC 


851 


ATATTTCCGC 


901 


GTCGGCACAC 


951 


CCTGCTGCTT 


1001 


GCCGACTTTT 



GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 
TCGATGGAAA GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 
TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 
ACGCCGGCGC AACGGTCAGC TCCGCTTTGG GCAAAGTGTT 
GAATCTTTTG TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 
GCTGGTTTTC GGCGCACGCA AAACAGGCGG GCTGAAAACC 
TGCTGATGCT GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 
GCAGGCAGCA CCGCCGCACA GGTTTCAGAC GGCATGAGTT 
AGTCGAGCTG TCCGCCGTGA TGCCGCTTTC TTGGCTGCCG 
ACTACACGCG CCACGCGCGC CGCCCGTTTG CGGCAACCCT 
CTCGCCTACA CGCTGACCGG CTGCTGGATG TATGCCTTGG 
GGCGTTGTTC ACCGGAGAAA CCGACGTGGC AAAAATCCTG 
GTTTGGGTGC GGCAGGCATT TTGGCGGTCG TCCTGTCGAC 
ACTTTTCTCG ATGCCTACTC CGCCGGCGTA AGTGCCAACA 
CAAACTTTCG GAAATACCCA TCGCCGTTGC CGTCGCCGTT 
TGCTTGCCGT CCTCCTGCCC GTTACCGAAT ATGATWVCTT 
ATCGGCTCGG TATTTGCGCC GATGGCG.GC GGTTTTGATT 



This corresponds to the amino acid sequence <SEQ ID 462; ORF 125. a>: 

al25 .pep 

1 MSGNASSPSS SAAIGLIWFG AAVSIAEIST GTLLAPLGWQ RGLAALLLGH 

51 AVGG ALFFAA AYIGALTGRS SMESVRLSFG KRGSVLFSVA NMLQLAGWTA 

101 VMIYAGATVS SALGKVLWDG ES FVWWALAN GALIVLWLV F GARKTGGLKT 

151 VSMLLMLLAV LWLSAEV FST AGSTAAQVSD GMSFGTAVEL SAVMPLSWLP 

201 LAADYTRHAR RPFAATLTAT LAYTLTGCWM YALGLAAALF TGETDVAKIL 

251 LGAGLGAAGI LAWL STVTT TFLDAYSAGV SANNISAKLS E IPIAVAVAV 

301 VGTLLAVLLP VTEYENFLLL IGSVFAPMAX GFDCRLFRLE TA* 



ml25/al25 95.6% identity in 342 aa overlap 

10 20 30 40 50 60 

ml25.pep MSGNASSPSSSSAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 
I I I I I I I I I I I : I I I I I I I I I I I I I I I i I I I i I I I I I i I I I I M I I I I I I I I I I I I I I I I 
al25 MSGNASSPSSSAAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml25 . pep AY IGALTGRSSMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVS SALGKVLWDG 

i 1 1 1 1 1 1 ) I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i ) 1 1 1 1 1 1 1 1 1 1 

a 1 2 5 AY IGALTGRS SMES VRLS FGKRG S VLFS VANMLQLAGWT AVMI YAGAT VS SALGKVLWDG 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 2 5 . pep ESFVWWALANGALIVLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFST AGSTAAQVSD 

M I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
al25 ESFVWWALANGALIVLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFSTAGSTAAQVSD 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml25 . pep GMSFGTAVELSAVMPLSWLPLAADYTRHARRPFAATLTAT LAYTLTGCWM YALGLAAALF 

I I I I I I I M I I I I I I I I M I I I i I I I I I I I I I I I I I I I ] I I I I I I I I I I t I I I I I M I I I 
al25 GMSFGTAVELSAVMPLSWLPLAADYTRHARRPFAATLTATLAYTLTGCWMYALGLAAALF 

190 200 210 220 230 240 



250 260 270 280 290 300 

nil25 . pep TGETDVAKILLGAXLGAAGILAWLSTVTTTFLDAYSAGASANNISARFAETPVAVXVTL 
I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I : I I I I I I I ::: t I : I I I : : 
al25 TGETDVAKILLGAGLGAAGILAWLSTVTTTFLDAYSAGVSANNISAKLSEIPIAVAVAV 
250 260 270 280 290 300 



310 320 330 340 

ml25.pep IGTVLAVMLPVTEYENFLLLIGSVFAPMAGGFDCRLFRLETAX 
: I I : I I I : I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I 
al25 VGTLLAVLLPVTEYENFLLLIGSVFAPMAXGFDCRLFRLETAX 
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310 



320 



330 



340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 463>: 

gl26 . seq 



1 AtgccgtcTG AAaccCcaaa ggcACGCCGC CGGCTTTCAG ACGGCATCGC 

51 GTCCGACAAC CATACCAAAG AATCCATCAT GCTCACCctg tacggcGAAA 

101 CTTTCCCTTC GCGGCTGCTg ctcggcacgG cggcctacCC GACCCCTGAA 

151 ATCCTCAAAC AATCCGTCCG AACCGCCCGG CCCGCGATGA ttaccGTCTC 

201 GCTGCGCCGC ACGGGATGCG GCGGCGAGGC GCACGGTCAG GGGTTTTGGT 

251 CGCTGCTTCA AQAAACCGGC GTTCCCGTCC TGCCGAACAC GGCAGGCTGC 

3 01 CAAAGCGTGC AGGAAGCGGT. AACGACGGCG CAAATGGCGC GCGAAGTGTT 
351 TGAAACCGAT TGGATAAAAT TGGAACTCAT CGGCGACGAC GACACCTTGC 

4 01 AGCCGGACGT GTTCCAACTC GTCGAAGCGG CGGAAATCCT GATTAAAGAC 
451 GGCTTCAAAG TGCTGCCTTA TTGCACCGAA GACCTGATTG CCTGCCGCCG 
501 CCTGCTCGAT GCGGGCTGTC AGGCGTTGAT GCCGTGGGCG GCTCCCATCG 
551 GCACGGGTTT GGGGGCGGTT CACGCCTATG CGCTCAAAAT CCTGCGCGAA 
601 CGCCTGCCCG ACACGCCGCT GATTATCGAC GCGGGCTTGG GTTTGCCTTC 
651 CCAAGCGGCA CAAGTGATGG AATGGG6TTT TGACGGCGTA TTGTTAAACA 
701 CCGCCGTTTC CCGCAGCGGC GACCCCGTCA ACATGGCGCG CGCCTTCGCA 
751 CTCGCCGTCG AATCCGGACG GCTGGCATTT GAAGCCGGGC CGGTCGAAGC 
801 GCGAACCAAA GCCCAAGCCA GCACGCCGAC AGTCGGACAA CCGTTTTGGC 
851 ATTCGGCGGA ATATTGA 



This corresponds to the amino acid sequence <SEQ ID 464; ORF 126.ng>: 

gl26 .pep 



1 MPSETPKARR RLSDGIASDN HTKESIMLTL YGETFPSRLL LGTAAYPTPE 

51 ILKQSVRTAR PAMITVSLRR TGCGGEAHGQ GFWSLLQETG VPVLPNTAGC 

101 QSVQEAVTTA QMAREVFETD WIKLELIGDD DTLQPDVFQL VEAAEILIKD 

151 GFKVLPYCTE DLIACRRLLD AGCQALMPWA APIGTGLGAV HAYALKILRE 

201 RLPDTPLIID AGLGLPSQAA QVMEWGFDGV LLNTAVSRSG DPVNMARAFA 

251 LAVESGRIiAF EAGPVEARTK AQASTPTVGQ PFWHSAEY* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 465>: 

ml26 . seq (partial) 



1 . . CACTATACAA AGGAACCCAT TATGCTCACC CTATACGGCG AAACTTTCCC 

51 CTCGCGGCTG CTGCTCGGCA CGGCTGCCTA CCCGACCCCC GAAATCCTCA 

101 AACAATCCAT CCAAACCGCC CAGCCTGCGA TGATTACCGT CTCGCTGCGC 

151 CGCGCGGGAA GCGGCGGCGA GGCGCACGGT CAGGGGTTTT GGTCGCTGCT 

201 TCAAGAAACC GGCGTTCCCG TCCTGCCGAA CACGGCAGGC TGCCAAAGCG 

251 TGCAGGAAGC GGTAACGACG GCGCAAATGG CGCGCGAAGT GTTTGAAACC 

301 GATTGGATAA AATTGGAACT CATCGGAGAT GACGACACCT TGCAGCCGGA 

3 51 TGTGTTCCAG CTTGTCGAAG CGGCGGAAAT CCTGATTAAA GACGGCTTCA 

401 AAGTGCTGCC TTATTGCACC GAAGACCTGA TTGCCTGCCG CCGCCTGCTC 

451 GACGCGGGCT GTCAGGCGTT GATGCCGTGG GCGGCCCCGA TCGGCACGGG 

501 TTTGGGCGCG GTTCACGCCT ACGCGTTGAA CGTCCTGCGC GAACGCCTGC 

551 CCGACACGCC GCTGATTATC GACGCGGGCT TGGGTTTGCC CTCACAGGCG 

601 GCACAAGTGA TGGAATGGGG CTTTGACGGC GTGCTTTTGA ATACTGCCGT 

651 TTCCCGCAGC GGCGATCCGG TCAATATGGC ACGCGCCTTC GCACTCGCCG 

701 TCGAATCCGG ACGGCTGGCA TTTGAAGCCG GACCGGTCGA AGCACGCGAC 

751 AAAGCGCAAG CCAGCACGCC GACAGTCGGA CAACCGTTTT GGCATTCGGC 

801 GGAATATTGA 



This corresponds to the amino acid sequence <SEQ ID 466; ORF 126>: 

ml26 .pep (partial) 

1 ..HYTKEPIMLT LYGETFPSRL LLGTAAYPTP EILKQSIQTA QPAMITVSLR 
51 RAGSGGEAHG QGFWSLLQET GVPVL.PNTAG CQSVQEAVTT AQMAREVFET 
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101 DWIKLELIGD DDTLQPDVFQ LVEAAEILIK DGFKVLPYCT EDLIACRRLL 

151 DAGCQALMPW AAPIGTGLGA VHAYALNVLR ERLPDTPLII DAGLGLPSQA 

201 AQVMEWGFDG VLLNTAVSRS GDPVNMARAF ALAVESGRLA FEAGPVEARD 

251 KAQASTPTVG QPFWHSAEY* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 126 shows 95.9% identity over a 269 aa overlap with a predicted ORF (ORF 126.ng) 
from A^. gonorrhoeae: 

nil26/gl26 

10 20 30 40 

ml26.pep ■ HYTKEPIMLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQ 

::||| llllllllllllllllllllll||||||||::||: 
gl26 MPSETPKARRRLSDGIASDNHTKESIMLTLYGETFPSRLLLGT7VAYPTPEILKQSVRTAR 
10 20 30 40 50 60 

50 60 70 80 90 100 

ral26 .pep PAMITVSLRRAGSGGEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 

Mlllllllhl II I II II I II II III II III II II I II II I II III 1 1 II II Mill I 

gl26 PAMITVSLRRTGCGGEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 
70 80 90 100 110 120 

110 120 130 140 150 160 

ml26.pep WI KLELI GDDDTLQPDVFQLVEA7VE I LI KDGFKVLP YCTEDL I ACRRLLDAGCQALMPWA 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

gl26 WIKLELIGDDDTLQPDVFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQAliMPWA. 

130 140 150 160 170 180 

170 180 190 200 210 220 

ml26 . pep APIGTGLGAVHAYALNVLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSG 
IIIIIMIIIIIII|::|IIIIIIIIIIIIIIMI||||||||IIIIIIII||||||||| 
gl26 APIGTGIiGAVHAYALKILRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSG 
190 200 210 220 230 240 

230 240 250 260 270 

ml 2 6 . pep DPVNMARAFALAVESGRLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 

IIIIIIIIIIIIMIIIIIIIIIIIIII IIIMMIIIIIIIIIIIII 

gl 2 6 DPVNMT^RAFALAVESGRLAFEAGPVEARTKAQASTPTVGQPFWHSAEYX 
250 260 270 280 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 467>: 

al26. seq 

1 TTGTTAATCC ACTATACAAA GGAACCCATT ATGCTCACCC TGTACAGCGA 

51 AACTTTCCCT TCGCGGCTGC TGCTCGGCAC AGCCGCCTAC CCGACCCCTG 

101 AAATCCTCAA ACAATCCGTC CGAACCGCCC GGCCCGCGAT GATTACCGTC 

151 TCGCTGCGCC GCGCGGGATG CGGCGGCGAG GCGCACGGTC AGGGGTTTTG 

201 GTCGCTGCTT CAAGAAACCG GCGTTCCCGT CCTGCCGAAC ACGGCAGGCT 

251 GCCAAAGCGT GCAGGAAGCG GTAACGACGG CGCAAATGGC GCGCGAAGTG 

301 TTTGAAACCG ATTGGATTAA ACTCGAACTC ATCGGCGACG ACGACACCTT 

351 GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC GGCGGAAATC CTGATTAAAG 

401 ACGGCTTCAA AGTGCTGCCT TATTGCACCG AAGACCTGAT TGCCTGCCGC 

451 CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG ATGCCGTGGG CGGCCCCGAT 

501 CGGCACGGGT TTGGGCGCGG TTCACGCCTA CGCGTTGAAC GTCCTGCGCG 

551 AACGCCTGCC CGACACGCCG CTGATTATCG ACGCGGGCTT GGGTTTGCCC 

601 TCACAGGCGG CACAAGTGAT GGAATGGGGC TTTGACGGCG TGCTTTTGAA 

651 TACTGCCGTT TCCCGCAGCG GCGATCCGGT CAATATGGCA CGCGCCTTCG 

701 CACTCGCCGT CGAATCCGGA CGGCTGGCAT TTGAAGCCGG ACCGGTCGAA 

751 GCACGCGACA AAGCGCAAGC CAGCACGCCG ACAGTCGGAC AACCGTTTTG 

801 GCATTCGGCG GAATATTGA 
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This corresponds to the amino acid sequence <SEQ ID 468; ORF 126.a>: 

al26.pep 

1 LLIHYTKEPI MLTLYSETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV 

51 SLRRAGCGGE AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV 

101 FETDWIKLEL IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR 

151 RLLDAGCQAL MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP 

201 SQAAQVMEWG FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE 

251 ARDKAQASTP TVGQPFWHS A EY * 

ml26/al26 98.1% identity in 269 aa overlap 

10 20 30 40 50 

HYTKEPIMLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGE 
i I I I I I I I I I I I : I I.I I I 1 I i I I I I I I I I I I I I I I I :: I I : I I i It I I I I I I I III 
LLIHYTKEPIMLTLYSETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRAGCGGE 
10 20 30 40 50 60 

60 70 80 90 100 110 

AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPD 
I II I I I I I I II 11 I II I II I I I II I I I I I I I I I II I I I I I I I I I I M I I M II I I I I II I 
AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPD 
70 80 90 100 110 120 

120 130 140 150 160 170 

VFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALN 
I II I I M II I I I I II I I I I II I I I I I I I I I I It I I I I I I I I II I I I I I I I I I I I II I I I t 
VFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALN 
130 140 150 160 170 180 

180 190 200 210 220 230 

VLRERLPDTPLIIDAGLGLPSQT^QVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESG 
I I I I I I I I I II I I I II t I I I I II I I I I I I I I t I I I I i I I I I I I I I I I I I I I i I I I I I II I 
VLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLKTAVSRSGDPVNMARAFALAVESG 
190 200 210 220 230 240 

240 250 260 270 

RLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 
I I I I I I I I I I I I I I II I I I II I i II I I I I I t I I 
RLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 
250 260 270 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 469>: 

gl26-l.seq 

1 ATGCTCACCC TGTACGGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 

51 GGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 

101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCACGGGATG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATAAA ATTGGAACTC 

301 ATCGGCGACG ACGACACCTT GCAGCCGGAC GTGTTCCAAC TCGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ATGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCTCCCAT CGGCACGGGT TTGGGGGCGG TTCACGCCTA 

501 TGCGCTCAAA ATCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCT TCCCAAGCGG CACAAGTGAT GGAATGGGGT 

601 TTTGACGGCG TATTGTTAAA CACCGCCGTT TCCCGCAGCG GCGACCCCGT 

651 CAACATGGCG CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG GCCGGTCGAA GCGCGAACCA AAGCCCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 470; ORF 126-1. ng>: 

gl26-l.pQp 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRTGCGGE 



ml26 .pep 
al26 

ml26 .pep 
al26 

ml26,pep 
al26 

ml26.pep 
al26 

ml26 .pep 
al26 
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51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALK ILRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARTKAQASTP 

251 TVGQPFWHSA EY* 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 471>: 

inl26-l .seq 

1 ATGCTCACCC TATACGGCGA AACTTTCCCC TCGCGGCTGC TGCTCGGCAC 

51 GGCTGCCTAC CCGACCCCCG AAATCCTCAA ACAATCCATC CAAACCGCCC 

101 AGCCTGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGAAG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGAT7VAA ATTGGAACTC 

301 ATCGGAGATG ACGACACCTT GCAGCCGGAT GTGTTCCAGC TTGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 



This corresponds to the amino acid sequence <SEQ ID 5; ORF 126-1>: 

ial26-l.pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSI QTAQPAMITV SLRRAGSGGE 

51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAQASTP 

251 TVGQPFWHSA EY* 



ml26-l/gl26-l 96.9% identity in 262 aa overlap 

10 20 30 40 50 60 

ial26-l . pep MLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGETVHGQGFWSLL 
IIIIIIIMMMIIIIIIIIIillllll::||:||||||lill:| I I I I I I I I I I I I I 
gl26-l MLTLYGETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRTGCGGEAHGQGFWSLL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 2 6-1 . pep QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDW I KLEL IGDDDTLQPD VFQLVEAAEI 
I I I I I I 1 I I I I I i I I I I I I I I I I i I I I I I I I I I t I 1 I I I I I I I I I I I I I I I I I i I I t I I I 
gl2 6-1 QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 2 6-1 . pep LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I :: I j I I I I I I I 
gl26-l LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALKILRERLPDTP 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml26-l . pep LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 
I I I I I I I I I I I I I I I I I I ) I I t I I I I I I I I I I i I I I I I I I I I I I t I I I I I I I I I I I I I I I 
gl2 6-1 LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 

190 200 210 220 230 240 



250 260 
ml26-l.pep ARDKAQASTPTVGQPFWHSAEYX 
II I I I I I I I I i I I I I I I I I I I I 
gl26-l ARTKAQASTPTVGQPFWHSAEYX 

250 260 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 473>: 

al26-l.seq 

1 ATGCTCACCC TGTACAGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 
51 AGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 
101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGATG CGGCGGCGAG 
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151 GCGCACGGTC AGGGGTTTTG 

201 CCTGCCGAAC ACGGCAGGCT 

251 CGCAAATGGC GCGCGAAGTG 

301 ATCGGCGACG ACGACACCTT 

351 GGCGGAAATC CTGATTAAAG 

401 AAGACCTGAT TGCCTGCCGC 

451 ATGCCGTGGG CGGCCCCGAT 

501 CGCGTTGAAC GTCCTGCGCG 

551 ACGCGGGCTT GGGTTTGCCC 

601 TTTGACGGCG TGCTTTTGAA 

651 CAATATGGCA CGCGCCTTCG 

701 TTGAAGCCGG ACCGGTCGAA 

151 ACAGTCGGAC AACCGTTTTG 



GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 
GCCAAAGCGT GCAGGAAGCG GTAACGACGG 
TTTGAAACCG ATTGGATTAA ACTCGAACTC 
GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC 
ACGGCTTCAA AGTGCTGCCT TATTGCACCG 
CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 
CGGCACGGGT TTGGGCGCGG TTCACGCCTA 
AACGCCTGCC CGACACGCCG CTGATTATCG 
TCACAGGCGG CACAAGTGAT GGAATGGGGC 
TACTGCCGTT TCCCGCAGCG GCGATCCGGT 
CACTCGCCGT CGAATCCGGA CGGCTGGCAT 
GCACGCGACA AAGCGCAAGC CAGCACGCCG 
GCATTCGGCG GAATATTGA 



This corresponds to the amino acid sequence <SEQ ID 474; ORF 126-l.a>: 

al26-l .pep 

1 MLTLYSETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRAGCGGE 

51 AHGQGrWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAQASTP 

251 TVGQPFWHSA EY* 



al26-l/ial26-l 98.1% identity in 262 aa overlap 

10 20 30 40 50 60 

al26-l.pep MLTLYSETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRAGCGGEAHGQGFWSLL 
ll)l|:||llllil)lllllllllllll|::M:|lltllllllll IIIIINIItDI 
ml26-l MLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGEAHGQGFWSLL 

10 20 30 40 50 60 



70 80 90 100 110 120 

al26-l.pep QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 
I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I i I I I I t I I I I I I I I I I I i I I I I I I 
ml2 6- 1 QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 

70 80 90 100 110 120 



130 140 150 160 170 180 

al26-l.pep LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 
lltllllllllllllllllllllllllllllllllllillilllllllllMMIIIIII 
ral26-l LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 

130 140 150 160 170 180 



190 200 210 220 230 240 

al26-l .pep LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 
I I I I I I I I I I I I I I I I I I I I I I I 11 i I t I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I 
ml2 6-1 LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 

190 200 210 220 230 240 



250 260 
al26-l.pep ARDKAQASTPTVGQPFWHSAEYX 
I I I I I I I I I I I M I I I I ) I [ I I I 
ml26-l ARDKAQASTPTVGQPFWHSAEYX 

250 260 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 475>: 

gl27 . seq 

1 ATGGAAATAT GGAATATGTT GAACACTTGG CCCGATGCCG TCCCGATACG 

51 CGCGGAGGCG GCCGAATCCG TGGCGGCGGT CGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAGAC GGCATCCGGA TTTCGGCATC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATT CAAACGCTGG 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACAAAAGAA 

301 CTGATTATGT GTCTGTCGGG CAGTATTTTA aggtctGCCA CCCAGCAATA 

351 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGGTCG 

401 ACATCAATCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 

451 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 

501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAT GTCATCCATA 
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551 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 TCAGCGGTAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

701 CCGCCAGGCC GCGCGTTACC CGCGTACCGT ACGACGACAA GGCATACCGC 

751 ATCATCGTCC GCTTCGCCTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 

801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCATC 

851 CCGCCGgctc cgAAACACTT TAA 



This corresponds to the amino acid sequence <SEQ ID 476; ORF 127.ng>: 

gl27 .pep 



1 MEIWNMLNTW PDAVPIRAEA AES VAAVAAL LLARALLLNI HFRRHPDFGI 

51 ESKRRFLVAS R NITLLLVLF SIiAFIWSA QI Q TLALSMFAV AAAVWAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRY LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 477>: 



1 ATGGAAATAT GGAATATGTT GGACACTTGG CTCGGTGCCG TCCCGATACG 

51 TGCGGAGGCG GTCGAATCCG TGGCGGCGGT TGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAAAC GGCATCCGGA TTTCGGCATC 

151 GAAAGC7VAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATC CAAACGCTGG 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACGAAGGAA 

301 CTGATTATGT GTCTGTCGGG CAGTATTTTA AGGTCTGCCA CCCAGCAATA 

351 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGGTGG 

401 ACATCAACCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 

451 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 

501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAT GTCATCCATA 

551 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 CCAACGGsAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

701 CCGCCAGACC GCGCGTTACC CGCGTGCCGT ACGATGACAA GGCATACCGC 

751 ATCATCGTCC GCTTCGCTTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 

801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCACC 

851 CCGCCGGCTC CGAAACACTT TAA 



This corresponds to the amino acid sequence <SEQ ID 478; ORF 127>: 

ml27 .pep 

1 MEIWNMLDTW LGAVPIRAEA VESVAAVAAL LIiARALLLNI HFKRHPDFGI 



51 ESKRRFLVAS RNITLLLVLF SLAFIWSA QI QTLALSMFAV AAAVWAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRX LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 127 shows 97.9% identity over a 290 aa overlap with a predicted ORF (ORF 127.ng) 
from N. gonorrhoeae: 

ml27/gl27 



ml27 . seq 



ml27 .pep 



10 20 30 40 50 60 

ME I WNMLDTWLGAVP I RAEAVESVAAVAALLLARALLLNI HFKRHPDFGI ES KRRFLVAS 



gl27 




10 20 30 40 50 60 



ml27 .pep 



70 80 90 100 110 120 

RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAVWATKELIMCLSGSILRSATQQYSVG 



gl27 
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70 80 90 100 110 120 

130 140 150 160 170 180 

ml27 . pep DYIEINGLRGRWDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 
gl 2 7 DY I E INGIiRGRWDINLLNTLMMQVGPNPLVGQLAGTTVS FPNSLLLSHPVRRDNI LGDY 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml27 . pep VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRXLENVQAEKLFITPAARPRVT 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii iiiiiiiiiiiiiiiiiiii 

gl27 VIHTVEIPVPIHUDSDEAVCRLKAVLEPLCAPYIPAIQRYLENVQAEKLFITPAARPRVT 
190 200 210 220 230 240 

250 260 270 280 290 

ml2 7 . pep RVPYDDKAYRI IVRFASPVSKRLEIQQAVMDEFLRVQYRLLNHPAGSETLX 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIII 

gl27 RVPYDDKAYRIIVRFASPVSKRLEICXJAVMDEFLRVQYRLLNHPAGSETLX 
250 260 270 280 290 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 479>: 

al27.seq 

1 ATGGAAATAT GGAATATGTT GGACACTTGG CTCGGTGCCG TCCCGATACG 

51 TGCGGAGGCG GTCGAATCCG TGGCGGTGGT CGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAAAC GGCATCCGGA TTTCGGCATC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATC CAAACGCTGG 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACGAAGGAA 

301 CTGATTATGT GTCTGTCGGG CAGCATTTTA AGGTCTGCCA CCCAGCAATA 

351 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGGTCG 

401 ACATCAACCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 

451 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 

501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAC GTCATCCATA 

551 CGGTCGAAAT CCCGGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 CCAACGGCAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

701 CCGCCAAACC GCGCGTTACC CGCGTGCCGT ACGATGACAA GGCATACCGC 

751 ATCATCGTCC GCTTCGCCTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 

801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATTACC 

851 CCGCCGGCTC CGAAACACTT TAA 

This corresponds to the amino acid sequence <SEQ ID 480; ORF 127.a>: 

al27.pep 

1 MEIWNMLDTW LGAVPIRAEA VES VAWAAL LLARALLLNI HFKRHPDFGI 

51 ESKRRFLVAS RN ITLLLVLF SLAFIWSA QI QTLALSMFAV AAA\AA/AT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRH LENVQAEKLF ITPAAKPRVT RVPYDDKAYR 

25l' IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNYPAGSETL * 

ml27/al27 98.6% identity in 290 aa overlap 

10 20 30 40 50 60 

ml27 . pep MEIWNMLDTWLGAVPIRAEAVESVAAVAALLLARALLLNIHFKRHPDFGIESKRRFLVAS 
I I M I I I I M I I I I I I I I I I I I I [ I : i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I j I 
al27 MEIVJNMLDTWLGAVPIRAEAVESVAWAALLLARALLLNIHFKRHPDFGIESKRRFLVAS 

10 20 30 40 50 60 

70 80 90 100 110 120 

inl27 .pep RNITLLLVLFSLAFIWSAQIQTIALSMFAVAAAVWATKELIMCLSGSILRSATQQYSVG 
I I I I I I I i I I I I ] I I I I t I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al27 RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAWVATKELIMCLSGSILRSATQQYSVG 
70 80 90 100 110 120 
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130 140 150 160 170 180 

ml27 . pep DYIEINGLRGRWDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 
I I I I I M I I t I I I I I i I I I I I I I I I I I M I I I I I I I I I t I I I I I I I I I I [ I i i I I I I I I I 
al27 DYIEINGLRGRVVDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml27 . pep VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRXLENVQAEKLFITPAARPRVT 
I I I I I I I I I I I 1 I I i I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I : I I I t 
al27 VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRHLENVQAEKLFITPAAKPRVT 
190 200 210 220 230 240 

250 260 270 280 290 

ml27 . pep RVPYDDKAYRIIVRFASPVSKRLEIQQAVMDEFLRVQYRLLNHPAGSETLX 
I I I I I I I I I I I I I I I I I M I j I I I I I I I I I I I I 1 I I i I I I I I : i i I I I t I I 
al27 RVPYDDKAYRIIVRFASPVSBCRLEIQQAVMDEFLRVQYRLLNYPAGSETLX 

250 260 270 280 290 



The following partial DNA sequence 

gl28 . seq 



1 


CI ^^CL ^ OVp»Ci 




















A nPfJTPTG A r 


201 






^ DX 




rV^AAPTGATn 






TfG A APTGT A 


351 


CGAATTTGCA 


ACGCTTTCCC 


401 


TGCGCGATTT 


CGTATTGAGC 


451 


GAACTGGCAA 


AACTGCAAAC 


501 


CCAAAACGTC 


CTAGACGCGA 


551 


CCGCACCGCT 


TGCCGGCATT 


601 


GCCGCGCAAA 


GCGAAGGCAA 


651 


GCACTACCTT 


GCCGTTATCC 


701 


AAATCTACCG 


CGCCTACGTT 


751 


AAATTCGACA 


ACACCGCCAA 


801 


AACCGccaaa 


CTGCTCGGCT 


851 


CCAAAATGGC 


GGACACGCCC 


901 


GCCCGCCGCG 


CCAAACCCTA 


951 


CTTCGCCCGC 


GAACACCTCG 


1001 


GCTACGCCGG 


CGAAAAACTG 


1051 


GAAGTCAAAA 


AATACTTCCC 


1101 


CCAAATCAAA 


AAACTCTACG 


1151 


TCTGGCACAA 


AGACGTGCGC 


1201 


ATCGGCGGCG 


TTTATATGGA 


1251 


CGCGTGGATG 


AACGACtaca 


1301 


TGCAACTGCC 


CACCGCCTAC 


1351 


GGCAAAGAAG 


CGCGTTTAAG 


1401 


AacCGGCCAC 


GGACTGCACC 


1451 


TGTCCGGCAT 


CAAcggcgtA 


1501 


TTTATGGAAA 


ACTTCGTTTG 


1551 


CCACGAAGAA 


AccgGCGAGC 


1601 


TcgcCGCCAA 


AAACTTCCAG 


1651 


TTCGCCCTCT 


TCGATATGAT 


1701 


GAAAAACTGG 


CAGCAGGTTT 


1751 


TCCAACCGCC 


CGAATACAAC 


1801 


GCcggcGGCT 


ATTCCGCAGG 


1851 


CAGCACCGAT 


GCCTACGCCG 


1901 


CAGGCAAACG 


CTTCTGGCAA 


1951 


gcgGCGGAAT 


CCTTCAAAGC 



; identified in N. gonorrhoeae <SEQ ID 48 1>: 

ccacttgggc gaagaaccCC GTTTTaatca 
AACCCGCCGT CCAAACCGCC ATCGCCGAAG 
GTCAAAGCGC AAACGCACAC CGGCTGGGCG 
CGGCATCACC GAACGCGTCG GCAGGATTTG 
ACTCCGTCGT GGACACGCCC GAACTGCGCG 
CCTGAAATCA CCGTCTTCTT CACCGAAATC 
CAACCGCTTC AAAACCATCA AAAATTCCCC 
CCGCACAAAA AACCAAGCTC GATCACGACC 
GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 
CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
CCGACGCGTT CGGCATTTAC TTTGACGATG 
CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
AACAGGTTAC AAAATCGGCT TGCAGATTCC 
AATACGCCGG CAACCGCGAA CTGCGCGAAC 
ACCCGTGCCA GCGAACTTTC AAACGACGGC 
CATCGACCGC ACGCTCGAAA ACGCATTGAA 
TTA/VAAATTA CGCCGAATTG TCGCTGGCAA 
GAACAGGTTT TAAACTTCCT GCACGACCTC 
CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 
CGCGAAGCCA AATACGCATT CAGCGAAACC 
CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 
GCATCGGATT CGCCGAAAAA ACCGTTCCCG 
TATTTTGAAT TGCAACAAAA CGGCAAAACC 
TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
AAGGCCGCCG CCGCTTTGCC GACGgcacGC 
CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 
CCACGACGAA ATCCTCACCC TCTTCCACGA 
ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
GAATGGGACG CGGTCGAACT GCCCAGCCAG 
GGAATACAAT GTATTGGCAC AAATGTCCGC 
CCCTGCCGAA AGAACTCTTC GACAAAATGC 
CGCGGTATGT TCCTCGTCCG GCAAATGGAG 
GATTTACAGT GAAAGCGACG AATGCCGTCT 
TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 
CGCTTCGCCA ACAGCTTCGG CCacatctTC 
CTATTACAGC TACGCATGGG CCGAAGTCCt 
CCTTTGAAGA AAGcGACGac gtcGCCGCCA 
GAAAtccttg ccgtcggcgg ctCCCGCAGC 
CTTCCGCGGA CGCGAACCGA GCATAGACGC 
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2001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 482; ORF 128.ng>: 

gl28 .pep 

1 MIDNALLHLG EEPRFNQIQT EDIKPAVQTA lAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 EliAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

3 51 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 
401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 
501 FMENFVWEYN VLAQMSAHEE . TGEPLPKELF DKMLAAKNFQ RGMFLVRQME 
551 FALFDMMIYS ESDECRLKNW QQVLDSVRKE VAVIQPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSTD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR QSGFDNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 483>: 

ml28.seq (partial) 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTGCGTCGC CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCAC 

1 TACGCCAGCG AAAAACTGCG CGAAGCCAAA TACGCGTTCA GCGAAACCGA 

51 wGTCAAAAAA TAyTTCCCyG TCGGCAAwGT ATTAAACGGA CTGTTCGCCC 

101 AAmTCAAAAA ACTmTACGGC ATCGGATTTA CCGAAAAAAC yGTCCCCGTC 

151 TGGCACAAAG ACGTGCGCTA TTkTGAATTG CAACAAAACG GCGAAmCCAT 

201 AGGCGGCGTT TATATGGATT TGTACGCACG CGAAGGCAAA CGCGGCGGCG 

251 CGTGGATGAA CGACTACAAA GGCCGCCGCC GTTTTTCAGA CGGCACGCTG 

301 CAAyTGCCCA CCGCCTACCT CGTCTGCAAC TTCGCCCCAC CCGTCGGCGG 

351 CAGGGAAGCC CGCyTGAGCC ACGACGAAAT CCTCATCCTC TTCCACGAAA 

401 CCGGACACGG GCTGCACCAC CTGCTTACCC AAGTGGACGA ACTGGGCGTA 

451 TCCGGCATCA ACGGCGTAkA ATGGGACGCG GTCGAACTGC CCAGCCAGTT 

501 TATGGAAAAT TTCGTTTGGG AATACAATGT CTTGGCACAA mTGTCAGCCC 

551 ACGAAGAAAC CGGcgTTCCC yTGCCGAAAG AACTCTTsGA CAAAwTGCTC 

601 GCCGCCAAAA ACTTCCAAsG CGGCATGTTC yTsGTCCGGC AAwTGGAGTT 

651 CGCCCTCTTT GATATGATGA TTTACAGCGA AGACGACGAA GGCCGTCTGA 

701 AAAACTGGCA ACAGGTTTTA GACAGCGTGC GCAAAAAAGT CGCCGTCATC 

751 CAGCCGCCCG AATACAACCG CTTCGCCTTG AGCTTCGGCC ACATCTTCGC 

801 AGGCGGCTAT TCCGCAGCTn ATTACAGCTA CGCGTGGGCG GAAGTATTGA 

851 GCGCGGACGC ATACGCCGCC TTTGAAGAAA GCGACGATGT CGCCGCCACA 

901 GGCAAACGCT TTTGGCAGGA AATCCTCGCC GTCGGGGnAT CGCGCAGCGG 

951 nGCAGAATCC TTCAAAGCCT TCCGCGGCCG CGAACCGAGC ATAGACGCAC 

1001 TCTTGCGCCA CAGCGGTTTC GACAACGCGG TCTGA 

This corresponds to the amino acid sequence <SEQ ID 484; ORF 128>: 

ml28 . pep (partial ) 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA lAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNCVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NH 



// 



1 YASEKLREAK YAFSETXVKK YFPVGXVLNG LFAQXKKLYG IGFTEKTVPV 

51 WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAWMNDYK GRRRPSDGTL 

101 QLPTAYLVCN FAPPVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 

151 SGINGVXWDA VELPSQFMEN FVWEYNVLAQ XSAHEETGVP LPKELXDKXL 

201 AAKNFQXGMF XVRQXEFALF DMMIYSEDDE GRLKMWQQVL DSVRKKVAVI 

251 QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA EVLSADAYAA FEESDDVAAT 
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301 GKRFWQEILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 128 shows 91.7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N. gonorrhoeae: 

ml28/gl28 

10 20 30 40 50 60 

MIDNALLHLGEEPRFNQIQTEDIKPAVQTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 

I iiiiiiiiiiiihihiiiiiihiiiiiiii iiihiiiiiiiiiMii mil 

MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPIiTGIT 
10 20 30 40 50 60 

70 80 90 100 110 120 

ERVGRIWGWSHLNSWDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

llllllllllllll hllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

ERVGRIWGWSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 



gl28 .pep 
inl28 

gl28 .pep 
ml28 



130 140 150 160 170 180 

g 12 8 . pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGI Y 

lllllllllhl 

ml 2 8 TLSPAQKTKLNH 
130 

// 

340 350 360 

g 1 2 8 . pep YAGEKLREAKYAFSETEVKKY FPVGKVLAG 

IMIIIIIIIIIIII Mil 1 1 II II I 

ml2 8 YASEKLREAKYAFSETXVKKYFPVGXVLNG 

10 20 30 



370 380 390 400 410 420 

gl28 .pep LFAQIKKLYGIGFAEKTVPVWHKDVRYFELOQNGKTIGGVYMDLYAREGKRGGAWMNDYK 

Mil ll.lllllhlllllllllllll II I II h:|l II I II II I II II I II II I II II 
ml28 LFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWMNDYK 
40 50 60 70 80 90 



430 440 450 460 470 480 

gl28.pep GRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVDELGV 

llllhllllllllllllllllllllhllllllllll lilllllllllllllllllM 

ml28 GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVDELGV 
100 110 120 130 140 150 



490 500 510 520 530 540 

gl28.pep SGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGEPLPKELFDKMLAAKNFQRGMF 

mill iimiiiiiiiiiiiiiiiiii iiiiiii mm ii iiiiiii m 

ml 2 8 SGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKEIiXDKXliAAKNFQXGMF 
160 170 180 190 200 210 



550 560 570 580 590 600 

gl2 8 . pep LVRQMEFALFDMMI YSESDECRLKNWQQVLDSVRKEVAVIQPPEYNRFANSFGHI FAGGY 

III iiimiiiiihii iiimiiimihiiiiiiiimii iiiimiii 

ml28 XVRQXEFALFDMMIYSEDDEGRLKNWQOVLDSVRKKVAVIQPPEYNRFALSFGHIFAGGY 
220 230 240 250 260 270 



610 620 630 640 650 660 

gl28 . pep SAGYYSYAWAEVLSTDAYAAFEESDDVAATGKRFWQE I LAVGGSRSAAES FKAFRGREPS 

ih iiiiiiiiihiiiiiiiiiiiiiiiiiiiiiimii iihiiiiiiiiiiiii 

ml28 SAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRGREPS 
280 290 300 310 320 330 
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IDALLRQSGFDNAAX 

Illllhlllllh 
IDALLRHSGFDNAVX 
340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 485>: 

al28.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

451 GAATTGGCAA AACTGCAAAC CGT^GGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCA7VACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAA/^ ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AMCTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 486; ORF 128.a>: 

al28.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA lAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 



gl28.pep . 
ml28 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAVVRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml 28/al 28 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml28 . pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
al28 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml28.pep ERVGRIWGVVSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
I I I I I I I I I I I I I I I : I I I I M I : I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I M I 
al28 ERVGRIWGVVSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 

ml28 . pep TLSPAQKTKLNH 

III I I I I M I I 

al28 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 



ml28.pep 

al28 FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 
190 200 210 220 230 240 



inl28.pep 

a 1 2 8 TRASELS DDGKFDNT AN I DRTLENALQTAKLLGFKN YAELS LATKbdADT PEQVLN FLHDL 

250 260 270 280 290 300 

140 150 

ml28 . pep YASEKLREAKYAFSETXVKKYFPVGX 

11:1111111111111 lllillll 
al28 ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

160 170 180 190 200 210 

ml28 . pep VLNGLFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWM 
IIIMIII II I I I M II I I I I i I I 1 II I I I llllllhllllllllllllllllllll 
al28 VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 

220 230 240 250 260 270 

inl28 . pep NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
llllllllllllllllllllllllhllllhilllllllll I M I I I I I I II t I I 11 I 
al28 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
430 440 450 460 470 480 

280 290 300 310 320 330 

ml28 . pep ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 
llllllllll I I I I II I I I I I I I I I I II II II I llllllllllllll II IIIIMI 
al28 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 

340 350 360 370 380 390 

ml28 . pep XGMFXVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 

Ml III Mllllllllll Illlltllllll|:ll|::|||l|||| |||||| 

al28 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAVVRPPEYNRFANSFGHIF 
550 560 570 580 590 600 
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400 410 420 430 440 450 

ml28 . pep AGGYSAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRG 
null: I II I II II I I I I I I I I I I I II M I I I j] I I II I t I I I I 111:111111111 
al28 AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
610 620 630 640 650 660 

460 470 
ml28 .pep REPSIDALLRHSGFDNAVX 

II I I II I II I I II I II I : 
al28 REPSI D ALLRHS G FDNAAX 

670 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 487>: 

gl28-l.seq (partial) 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCXSTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CX3AACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CX5AATTTGCA ACGCTTTCCC CCX3CACAAAA AACCAAGCTC GATCACGACC 

401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACX3GCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCAirTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT T6CAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGCCAAA CTGCTCX3GCT TTAAAAATTA CGCCGAATTG TOGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAtSAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

1401 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

1451 TGTCCGGCAT CAACGGCGTA AAA 

This corresponds to the amino acid sequence <SEQ ID 488; ORF 128-1. ng>: 

gl28-l.pep (partial) 

1 MIDNALIiHLG EEPRFNQIKT EDIKPAVQTA lAEARGQIAA VKAQTHTGWA 

51 NTVERIiTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 489>: 

ml28-1.8eg 

1 ATGACraACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 
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201 






251 






301 


GGACAAGACA 


TCGAGCTGTA 


351 




n\^\r\^ X w X v^wk^ 


401 


TCPG fR ATTT 


rGTCTTCAGf 
L>U X 1 UMuW 


451 


X \j\s\^nt\ 


AA(TrGC!AAAC? 


501 


nnnnv^u X ^ 


CTAGAGGf^A 


551 


CCGCACCGCT 


TG CCGGCATT 


601 










J. X L. L. 


701 




fV n a rvrT" 

^uL.^.lAW\2j. 1 




MAM 1 Jl (vVtML-M 


anacrTTT^aa 

Mv. MU CL.MM 


801 


a a r'/^r'r'na a a 

MML. V. Vj U L.MAM 




OD J. 


r'p a a a a Tf n r* 


fir* a r* a rvrTT' 


901 




fv a a a f^f^r*T a 
L\.AAAL.UL# i A 


?3 J. 




naaarT'C'Tva 


1001 




rv^aa a a ar^/^ 


1051 


r* a a p T/^ a a a a 


AATACTTCCC 


1101 




aa a/^n^^^ft 
ArtA(-Tl-TACCa 


1151 


TT'TY^ r* a r* a a 
1 L. AVaVjCAL-ftM 


a ^ a TY^ rvr* 
A(jAL.u 1 uLasL. 


1201 


A 1 AkjLsCoLa^Vj 


TTTATATGGA 




L.VjU\9 IvjijA 1 \3 


a a ^r* a r'Ta oa 


1301 


J.\3UAA\. i\3(^L. 


LAk^ULi^^i. Av, 






rv^ r*r! rT" Tr* a r* 


1401 






1451 


TATCCGGCAT 


CAACGGCGTA 


1501 


TTTATGGAAA 


ATTTCGTTTG 


1551 


CCACGAAGAA 


ACCGGCGTTC 


1601 


TCGCCGCCAA 


AAACTTCCAA 


1651 


TTCGCCCTCT 


TTGATATGAT 


1701 


GAAAAACTGG 


CAACAGGTTT 


1751 


TCCAGCCGCC 


CXJAATACAAC 


1801 


GCAGGCGGCT 


ATTCCGCAGG 


1851 


GAGCGCGGAC 


GCATACGCCG 


1901 


CAGGCAAACG 


CTTTTGGCAG 


1951 


GCGGCAGAAT 


CCTTCAAAGC 


2001 


ACTCTTGCGC 


CACAGCGGTT 



ACTCCGTCGC CGACACGCCC GAACTGCGCG 
CCCGAAATCA CCGTCTTCTT CACCGAAATC 
CAACCGCTTC AAAACCATCA AAAATTCCCC 
CCGCACAAAA AACCAAACTC AACCACGATC 
GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 
CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
CCGACGCGTT CGGCATTTAC TTTGACGATG 
CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
AACAGGCTAC AAAATCGGCT TGCAGATTCC 
AATACGCCGA CAACCGCGAA CTGCGCGAAC 
ACCCGCXSCCA GCGAACTTTC AGACGACGGC 
CATCGACCGC ACGCTCGCAA ACGCCCTGCA 
TCAAAAACTA CGCCGAATTG TCGCTGGCAA 
GAACAAGTTT TAAACTTCCT GCACGACCTC 
CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
ACCTCGCCGA TTTGCAACCG TGGGACTTGG 
CGCGAAGCCA AATACX3CGTT CAGCGAAACC 
CGTCGGCAAA GTATTAAACG GACTGTTCGC 
GCATCGGATT TACCGAAAAA ACCGTCCCCG 
TATTTTGAAT TGCAACAAAA CGGCGAAACC 
TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
AAGGCCGCCG CCGTTTTTCA GACGGCACGC 
CTCGTCTGCA ACTTCGCCCC ACCCX5TCGGC 
CCACGACGAA ATCCTCATCC TCTTCCACGA 
ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
GAATGGGACG CGGTCGAACT GCCCAGCCAG 
GGAATACAAT GTCTTGGCAC AAATGTCAGC 
CCCTGCCGAA AGAACTCTTC GACAAAATGC 
CGCGGCATGT TCCTCGTCCG GCAAATGGAG 
GATTTACAGC GAAGACX3ACG AAGGCCGTCT 
TAGACAGCGT GCGCAAAAAA GTCGCCGTCA 
CGCTTCGCCT TGAGCTTCGG CCACATCTTC 
CTATTACAGC TACGCGTGGG CGGAAGTATT 
CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 
CTTCCGCGGC CGCGAACCGA GCATAGACGC 
TCGACAACGC GGTCTGA 



This corresponds to the amino acid sequence <SEQ ID 490; ORF 128-1>: 

ml28-l .pep. 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA lAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NHDLRDFVLS GAELPPEQQA 

151 EliAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSESKTGY KIGLQIPHYL AVIQYADNRE LREQIYRAYV TRASELSDDG. 

251 KFDNTANIDR TLANALQTAK LLGFKNYAEL SLATKMADTP EQVIiNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLNLADIiQP WDLGYASEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQMGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFAPPVG 

451 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAV* 



na28-l/gl28-l 94.5% identity in 491 aa overlap 



10 20 30 40 50 60 

gl 2 8 - 1 . pep MIDNALLHLGEEPRFNQIKTEDIKPAVQTAI AEARGQIAAVKAQTHTGWANTVERLTGIT 

I llllllllllllhlllllllllhllllllll lllhlllllllllllll Mill 

ml 2 8 - 1 MTDNALLHLGEEPRFDQIKTEDI KPALQTAIAEAREQIAAI KAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 



70 80 90 100 110 120 

gl2 8 - 1 . pep ERVGRIWGWSHLNSWDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

lllllllllllllllhlllllllllllllllillllllllllllllllllllllllll 
ml28 -1 ERVGRIWGWSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 
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130 140 150 160 170 180 

gl2 8 - 1 . pep tlspaqktkix>hdlrdft;lsgaelpperqaelaku3tegaqlsakfsqnvldatdafgi Y 

lllllllll|:|||llllllllllllhllllllllllllllllllllllllllllllll 
ml2 8 - 1 TLSPAQKTKLNHDIiRDFVLSGAELPPEQQAEIAKLQTEGAQI^AKFSQNVLDAT^^ 

130 140 150 160 170 180 



190 200 210 220 230 240 

gl2 8 - 1 . pep FDDAAPLAGIPEDALAMFAAAAQSEGFCTGYKIGLQIPHYLAVIQYAGNRELREQIYRAYV 

llllllllllllllllllllllllhllllllllllllllllilll lllllllllllll 

ral2 8 -1 FDDAAPIAGIPEDAIiT^FAAAAQSESKTGYKIGLQI PHYLAVIQYADNRELREQIYRAYV 

190 200 210 220 230 240 



250 260 270 280 290 300 

gl2 8 - 1 . pep TRASELSNIX3KFDNTANIDRTLENALKTAKLLGFKNYAELSLATKKADTPEQVLNFLHDL 

llllllhllllllllllllll llhlllllllllllllilllllllllllllllllll 

ml 2 8 - 1 TRASELSDIXSKFDNTANIDRTIJVNALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
250 260 270 280 290 300 



310 320 330 340 350 360 

gl 2 8 - 1 . pep ARRAKPYAEKDLAEVKAFAREHLGLADPQPWDLSYAGEKLREAKYAFSETEVKKYFPVGK 

lllllllllllllllllllll llllhlhlllllllllllllllllllllll 
ml 2 8 - 1 ARJUVKPYAEKDLAEVKAFARESLNLADLQPWDLGYASEKLREAKYAFSETEVKKYFPV^ 
310 320 330 340 350 360 



370 380 390 400 410 420 

gl 2 8 - 1 . pep VIAGLFAQIKKLYG IGFAEKTVPVWHKDVRYFELQQNGKTIGGVYMDLYAREGKRGGAWM 

II lllllllllllllhlllllllMllllllllllhlllllllllllllMIIIIII 

ml28-l VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 



430 440 450 460 470 480 

gl2 8 - 1 . pep NDYKGRRRFADGTLQLPTAYLVCNPAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

illlllllhllllllllllllllllllllhllllllllll lllllllllllllllll 

ml 2 8 - 1 NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGRBARLSHDEI LI LFHETGHGLHHLIiTQVD 

430 440 450 460 470 480 



490 

gl2 8 - 1 . pep ELGVSGINGVK 
lllllllllh 

ml2 8 - 1 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 491>: 

al28-1.8eq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCC6AAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCX3CCCGA ACAGCAGGCA 

451 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 T^CCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCX3TCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCX3AAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACX3CGGCGG 
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1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

1401 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

1451 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 492; ORF 128-1. a>: 

al28-l.pQp 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA lAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 EIAKLQTEGA QLSAKFSQNV LDATDAFGIY PDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLABVKAPAR ESLGIiADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

10128 -l/al28-l 97.8% identity in 677 aa overlap 

10 20 30 40 50 60 

MTDNALLHLGEEPRFDQIKTEDIECPALQTAIAEAREQIAAIKAQTHTCWANTVEPLTGIT 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

MTDNALIiHLGEEPRFDQIKTEDIICPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
10 20 30 40 50 60 

70 80 90 100 110 120 

ERVGRIWGWSHLNSVTDTPELRAAYNELMPE ITVFFTE IGQDI ELYNRFKTI KNSPEFD 

IIIIIIIIIIIIMIhllllllhlllllllllllllllllllllllllllllllllll 

ERVGRIWGWSHLNSVADTPELRAVYNELMPE ITVFFTE IGQDIELYNRFKTI KNSPEFD 
70 80 90 100 110 120 

130 140 150 160 170 180 

TLSHAQKTKIjNHDUUJFVLSGAELPPEQQAEIAKLQTEGAQLSAKFSQmn^ATDAFGIY 

III llllllllllllllllllllllllllllllllllllllllllllllllllllllll 
TLSPAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 

190 200 210 220 230 240 

FDDAAPLiAGI PEDALAMFAAAAQSBGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 

llllllllllllllllllllllllhllllllllllllllllllllllhllllllllll 
FDDAAPLAGIPEDALAMFAAAAQSESKTGYKIGLQIPHYLAVIQYADNRELREQIYRAYV 
190 200 210 220 230 240 

250 260 270 280 290 300 

TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

llllllllllllllliiilill llllllllllllllllllllilliiiiiiiiiiiiii 

TRASELSDDGKFDNTANIDRTUVNALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
250 260 270 280 290 300 



al28-l .pep 
ml28-l 

al28-l -pep 
ml28-l 

al28-l.pep 
ml28-l 

al28-l .pep 
ml28-l 

al28-l .pep 
ml28-l 



al28-l.pep 



310 320 330 340 350 360 

ARRAKPYAEKDIAEVKAFARESLGLADl^PWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
llllllllllllllllllllllhlllllllllilhlllllllllllllllllllllll 
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ml2 8 - 1 ARRAKPYAEKDIAEVKAFARESIiNUUDLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

370 380 390 400 410 420 

al 2 8 - 1 . pep VLNGLFAQIKIOiYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLyAREGKRGGAWM 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
ml 2 8 - 1 VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFEl^QQNGETIGGVYriDLYAREGKRGGAWM 
370 380 390 400 410 420 

430 440 450 460 470 480 

al 2 8 - 1 . pep NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

IIIIIIIIIIIIIIIIIIIMIIIhllllhllllllllll lllllllllllllllll 

Tnl2 8 - 1 NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
430 440 450 460 470 480 

490 500 510 520 530 540 

al28-l .pep EIX3VSGINGVEWDAVELPSQFMENFWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
ml 2 8 - 1 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 

550 560 570 580 590 600 

al 2 8 - 1 . pep RGMFLVRQMEFALFDMMI YSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSPGHI F 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihiihHiiiiiii mill 

ml 2 8 - 1 RGMFLVRQMEFALFDMMI YSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHI F 

550 560 570 580 590 600 

610 620 630 640 650 660 

al 2 8 - 1 . pep AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 

tlllllllllllllllllllllllllllllllllllllllllllllllllllllllllll' 
ml28-l AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
610 620 630 640 650 660 

670 679 
al 2 8 - 1 . pep REPSIDALLRHSGFDNAAX 

llllllllllllllllh 

ml 2 8 - 1 REPSIDALLRHSGFDNAVX 
670 

al2B-l/ P44573 

sp| P44573|0PDA_HAEIN OLIGOPEPTIDASE A >gi 1 1075082 | pir | | C64055 oligopeptidase A (prlC) homolog 
- Haemophilus influenzae (strain Rd KW20) 

>gi 1 1573174 [U32706) oligopeptidase A (prlC) [Haemophilus influenzae Rd] Length = 681 
Score = 591 bits (1507), Expect = e-168 

Identities « 309/677 (45%), Positives = 415/677 (60%), Gaps = 4/677 (0%) 

NALLHLGEEPRFDQIKTEDIKPALQTXXXXXXXXXXXXXXXTHTGWANTVEPLTGITERV 63 
N LL++ P F QIK E I+PA++ H W N + PLT +R+ 

NPLLNIQGLPPFSQIKPEHIRPAVEKLIQDCRNTIEQVLKQPHFTWENFILPLTETNDRL 64 

GRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFDTLS 123 

R W VSHLNSV ++ ELR AY +p ++ + T +GQ LYN + +KNS EF S 
NRAWSPVSHLNSVKNSTELREAYQTCLPLLSEYSTWVGQHKGLYNAYLALKNSAEFADYS 12 4 

HAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGI YFDD 183 
AQK + + LRDF LSG L E+Q ++ ++L-H-+FS NVLDAT + ++ 



A LAG+PE AL +A+S+G GY+ L+IP YL V+ Y +NR LRE++YRAY TRA 



SE + GK+DN+ ++ L ++ AKLLGF Y ELSLATKMA+ P+QVL+FL LA 



RAKP EK+L E+K + + G+ +L PWD+G+ EK ++ YA ++ E++ YFP +V+ 



Query: 


4 


Sb j ct : 


5 


Query: 


64 


Sb j ct : 


65 


Query: 


124 


Sbjct : 


125 


Query: 


184 


Sbjct: 


185 


Query: 


244 


Sbjct: 


245 


Query: 


303 


Sb j ct : 


305 
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Query: 363 NGLFAQIKKLYGIGFTE-KTVPVWHKDVRYFEL-QQNGETIGGVYMDLYAREGKRGGAWM 420 

+GLF IK+++I EKV WHKDVR+F+L +N + G Y+DLYARE KRGGAWM 
Sbjct: 365 SGLFELIKRIFNIRAVERKGVOTWHKDVRFFDLIDENDQLRGSFYLDLYAREHKRGGAWM 424 

Query: 421 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEIXXXXXXXXXXXXXXXXQVD 480 

+D GR+R DG+++ P AYL CNF P+G K A +H+E+ Q+D 
Sbjct: 425 DDCIGRKRKLIXSSIETPVAYLTCMFNAPIGNKPALFraNEVTTLFHEFGHGIHHMLrQID 4 84 

Query: 481 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 540 

V+GINGV WDAVELPSQFMEN+ WE LA +S H ETG PLPKE ++L AKNFQ 
Sbjct: 4 85 VSDVAGINGVPWDAVELPSQFMENWCWEEEAIAFISGHYETGEPLPKEKLTQLLKAKNFQ 544 

Query: 541 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGHIF 600 

MF++RQ+EF +FD ++ D + L SV+ +VAV++ ++ R +SF HIF 

Sbjct: 54 5 AAMFILRQLEFGIFDFRIiHHTFDAEKTNQILDTLKSVKSQVAVIKGVDWARAPHSFSHIF 604 

Query: 601 XXXXXXXXXXXXWAEVLSADAYAAFEESDDV-AATGKRFWQEILAVGGSRSAAESFKAFR 659 

WAEVIiSADAY+ FEE TGK F EIL GGS E FK FR 

Sbjct: 605 AGGYAAGYYSYLWAEVLSADAYSRFEEEGIFNPITGKSFLDEILTRGGSEEPMELFKRFR 664 

Query: 660 GREPSIDALLRHSGFDN 676 

GREP +DALLRH G N 
Sbjct: 665 GREPQLDALLRHKGIMN 681 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 493>: 

gl29.seq 

1 ATGCTTTCAC CTCCTCGGCG TAAAACGGCG GCACATCAAT CAAGCCGTCT 

51 TTCATTTGCG TGCGGAAAAA ATGCGGCGTG TTGCCGTGAT CAAAATCAAT 

101 ATCGTGCAGC ATCCAGCCCA AATCGCGGTT TGCCTCGCTT TCCGATAACG 

151 CCGACGGCGG CAGCGGTTCA CCCTTATCCG CGCTTTCGCC ATTTGCCCTT 

201 TCAGGCTGCG GGCATAGGGG CGGAACAGGC GGCGGTCGAA TCCTGTTTCA 

251 TCCGGACAAA CGCGTTGGCA GTCGGAAAAT CCGGCCGGCC GTGTCAAATA 

301 ATGCGTTACT TTGGCCGGGT CTTGTCCTTT GTAAGCGGCG GTCTTTTTTT 

351 GCGCGCCATC CGCATCTGTT TGGGCGCATG GCAAACGGCG GCTGCCGTAC 

401 AATCAAAATG TTTGGCGATT TCATGCAGAC AGGCATCCGG ATGCCGCCCG 

451 ACATATCGAG CCGGTTTTTG CCTATCCGAT TTGGCGGCAT TTAGGCCGGT 

501 AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 494; ORF 129,ng>: 

gl29.pep 

1 MLSPPRRKTA AHQSSRLSFA CGKNAACCRD QNQYRAASSP NRGLPRFPIT 

51 PTAAAVHPYP RFRHLPFQAA GIGAEQAAVE SCFIRTNALA VGKSGRPCQI 

101 MRYFG RVL5F VSGGLFLRAI RIC LGAWQTA AAVQSKCLAI SCRQASGCRP 

151 TYRAGFCLSD LAAFRPVT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 495>: 

Tnl29.seq (partial) 

1 ..TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 

51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 

101 GAAAATTCGG CCGGCTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 

201 TGCATAGCAA ATGGTGGCTG CCGTACAATC AAAATGTTTG GCGATTTCAT 

251 GCAGATAGGC ATCCGGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

301 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 496; ORF 129>: 

Tnl2 9.pep (partial) 

1 . . YLRFHYLPFQ AAGIGTEQVA VKSCFIQINT LWGKFGRLC QIMRYFGRVL 

51 FFVSGGLFLR VIPIC LSAXQ MVAAVQSKCL AISCRXASGC CPTYXAGFCL 

101 SDLTAFRPVT * 

Computer analysis of this amino acid sequence gave the following results: 



wo 99/57280 PCT/US99/09346 

374 



Homology with a predicted ORF from N, gonorrhoeae 

ORF 129 shows 79.1% identity over a 1 10 aa overlap with a predicted ORF (ORF 129.ng) 
from M gonorrhoeae: 

ml29/gl29 

10 20 30 

ml29 .pep YLRFHYLPFQAAGIGTEQVAVKSCFIQINT 

I lh:|lllllllhl|:||:||||: h 
gl2 9 RDQNQYRAASSPNRGLPRFPITPTAAAVHPYPRFRHLPFQAAGIGAEQAAVESCFIRTNA 
30 40 50 60 70 80 

40 50 60 70 80 90 

ml29 . pep LWGKFGRLCQir4RYFGRVLFFVSGGLFLRVIPICLSAXQMVAAVQSKCLiAISCRXASGC 

hill II IIIIIIIMII IIIMIIIhl I I hi I :| I II II I II I III INI 
g 1 2 9 LAVGKSGRPCQI MRYFGRVLS FVSGGLFLRAI RI CLGAWQTAAAVQS KCLAI S CRQASGC 

90 100 110 120 130 140 

100 110 
ml29 .pep CPTYXAGFCLSDLTAFRPVTX 

III lllllllhlllllll 
gl29 RPTYRAGFCLSDLAAFRPVTX 
150 160 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 497>: 

al29.8eq (partial) 

1 TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 

51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 

101 GAAAATTCGG CCAGCTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 

201 TGCATAGCAA ATGGTGGCTG CCGTACAATC AT^TGTTTG GCGATTTCAT 

251 GCAGATAGGC ATCCTGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

301 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 498; ORF 129.a>: 

al29.pGp (partial) 

1 YLRFHYLPFQ AAGIGTEQVA VKSCFIQINT LWGKFGQLC QIMRYFGRVL 

51 FFVSGGLFLR VIPIC LSA*Q MVAAVQSKCL AISCR*ASWC CPTY*AGFCL 

101 SDLTAFRPVT * 

inl29/al29 98.2% identity in 1 10 aa overlap 

10 20 30 40 50 60 

nil29.pep YLRFHYLPFQAAG I GTEQVAVKSCFIQINTLWGKFGRLCQIMRYFGRVL FFVSGGLFLR 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I M I I I I 
al29 YLRFHYLPFQAAGIGTEQVAVKSCFIQINTLVVGKFGQLCQIMRYFGRVLFFVSGGLFLR 

10 20 30 40 50 60 

70 80 90 100 110 

ml29 . pep VIPICLSAXQMVAAVQSKCLAISCRXASGCCPTYXAGFCLSDLTAFRPVTX 
I I I M I I I I I I I I I I I I I I I I I I I I I I I i I I I 1 I I I I I I I I I I I I I I I 1 I 
al29 VIPICLSAXQMVAAVQSKCLAISCRXASWCCPTYXAGFCLSDLTAFRPVTX 

70 80 90 100 110 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 499>: 

gl30 . seq 

1 ATGAAACAAC TCCGCGACAA CAAAGCCCAA GGCTCTGCAC TGTTTACCCT 

51 TGTGAGCGGT ATCGTTATTG TTATTGCAGT CCTTTATTTC CTGATTAAGC 

101 TGGCGGGCAG TGGATCGTTC GGCGATGTCG ATGCCACTAC GGAAGCGGCA 

151 ACGCAGACCC GCATCCAGCC TGTCGGACAA TTGACGATGG GTGACGGCAT 

201 CCCCGTCGGC GAACGCCAAG GCGAACAGAT TTTCGGCAAA ATCTGTATCC 
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251 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCTCCGAA ACTGGAACAC 

301 AACGGCGACT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 

351 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGCAG 

401 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACCTACAT GGCGAATAAA 

4 51 AGCGGCGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 

501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 

551 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTGCGGTCGG CGTTGACGGT 

601 AAAAAAGTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 

651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTATCAAAA 

701 AAGGCAAAGA AACCTTGCAC AAACATGCCC TTGAAGGCTT TAACGCGATG 

751 CCGGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 

801 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 



This corresponds to the amino acid sequence <SEQ ED 500; ORF 130.ng>: 

gl30.pep 

1 MKQLRDNKAQ GSALF TLVSG IVIVIAVLYF LI KLAGSGSF GDVDATTEAA 



51 TQTRIQPVGQ LTMGDGIPVG ERQGEQIFGK ICIQCHAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAADLTDQEL KRAITYMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAAPAVGVDG 

201 KKVFEATCQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 501>: 

inl30.seq (partial) 



1 . .GGCGAACAGA TTTTCGGCAA AATCTGTATC CAATGCCACG CGGCGGACAG 

51 CAATGTGCCG AACGCTCCGA AACTGGAACA CAACGGCGAT TrGGCACCGC 

101 GTATCGgCAA GGCTTCGATA CCTTGTTCCA ACACGCGCTG AACGGCTTTA 

151 ACGCCATGCC TGCAAAAGGC GGTGCGGCAG ACCTGACCGA TCAGGAACTT 

201 AAACGGGCGA TTACTTACAT GGCGAACAAA AGCGGCGGTT CTTTCCCGAA 

251 TCCTGATGAG GCTGCGCCTG CCGACAATGC CGCTTCAGGA ACAGCTTCTG 

301 CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG CGAAGGCAGA AGACAAGGGT 

351 GCGGCACCCC TGCGGTCGGC GTTGACGGTA AAAAAGTCTT CGAAGCAACC 

4 01 TGTCAGGTGT GCCACGGCGG TTCGATTCCC GGTATTCCCG GCATAGGCAA 

451 AAAAGACGAT TGGGCACCGC GTATCAAAAA AGGCAAAGAA ACCTTGCACA 

501 /WVCACGCCCT TGAAGGCTTT AACGCGATGC CTGCCAAArG CGgCAATGCA 

551 GGTTTGAGCG ATGACGAAgT CAAAGCGGCT GTTGACTATA TGGCAAACCA 

601 ATCCGGTGCA AAATTCTAA 



This corresponds to the amino acid sequence <SEQ ID 502; ORF 130>: 

ml3 0 . pep ( part ial ) 

1 ..GEQIFGKICI QCHAADSNVP NAPKLEHNGD XAPRIQGFDT LFQHALNGFN 



51 AMPAKGGAAD LTDQELKRAI TYMANKSGGS FPNPDEAAPA DNAASGTASA 

101 PADSAAPAEA KAEDKGAAPA VGVDGKKVFE ATCQVCHGGS IPGIPGIGKK 

151 DDWAPRIKKG KETLHKHALE GFNAMPAKXG NAGLSDDEVK AAVDYMANQS 

201 GAKF* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 130 shows 98.1% identity over a 206 aa overlap with a predicted ORF (ORF 130.ng) 
from A^. gonorrhoeae: 

ml30/gl30 



ml30 .pep 



10 20 30 

GEQ I FGKI CI QCHAADSNVPNAPKLEHNGD 



gl30 




50 60 70 80 90 100 



ml30 .pep 



40 50 60 70 80 89 

XAPRI-QGFDTLFQHAIiNGFNAMPAKGGAADLTDQELKRAITYMANKSGGSFPNPDEAAP 



gl30 




110 120 130 140 150 160 
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90 100 110 120 130 140 

ADNAASGTASAPADSAAPAEAKAEDKGAA-PAVGVDGKKVFEATCQVCHGGSIPGIPGIG 

Illllllllllllllllllllllllllll llllllllllllllllllllllllllllll 
ADNAASGTASAPADSAAPAEAKAEDKGAAAPAVGVDGKKVFEATCQVCHGGSIPGIPGIG 
170 180 190 200 210 220 

150 160 170 180 190 200 

KKDDWAPRIKKGKETLHKHALEGFNAMPAKXGNAGLSDDEVKAAVDYMANQSGAKFX 

llllllllllllllllllllllllllllll llllllllllllllllllllllllll 
KJODWAPRIKKGKETLHKHALEGFNAMPAKGGNAGLSDDEVKAAVDYMANQSGAKFX 
230 240 250 260 270 280 

The following partial DNA sequence w^s identified in N. meningitidis <SEQ ID 503>: 

al30 , seq 

1 ATGAAACAAC TCCGCGACAA CAAAGCCCAA GGCTCTGCAC TGTTTACCCT 

51 TGTGAGCGGT ATCGTTATTG TTATTGCAGT CCTTTATTTC CTGATTAAGC 

101 TGGCGGGCAG CGGCTCGTTC GGCGATGTCG ATGCCACTAC GGAAGCAGCA 

151 ACGCAGACCC GTATCCAGCC TGTCGGACAA TTGACGATGG GCGACGGCAT 

201 CCCCGTCGGC GAACGCCAAG GCGAACAGAT TTTCGGCAAA ATCTGTATCC 

251 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCTCCGAA ACTGGAACAC 

301 AACGGCGATT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 

351 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGTAG 

4 01 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACTTACAT GGCGAACAAA 

4 51 AGCGGCGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 

501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 

551 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTGCGGTCGG CGTTGACGGT 

601 AAAT^GTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 

651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTATCAAAA 

7 01 AAGGCAAAGA AACCTTGCAC AAACACGCCC TTGAAGGCTT TAACGCGATG 

751 CCTGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 

801 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 

This corresponds to the amino acid sequence <SEQ ID 504; ORF 130.a>: 

al30.pep 

1 MKQLRDNKAQ GSAL FTLVSG IVIVIAVLYF LI KLAGSGSF GDVDATTEAA 

51 TQTRIQPVGQ LTMGDGIPVG ERQGEQIFGK ICIQCHAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAVDLTDQEL KRAITYMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAAPAVGVDG 

201 KKVFEATCQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 

ml30/al30 97.6% identity in 206 aa overlap 

10 20 30 

GEQI FGKICIQCHAADSNVPNAPKLEHNGD 
llllllllilllllllllllllllllllll 
DATTEAATQTRIQPVGQLTMGDGIPVGERQGEQIFGKICIQCHAADSNVPNAPKLEHNGD 
50 60 70 80 90 100 

40 50 60 70 80 89 

XAPRI-QGFDTLFQHALNGFNAMPAKGGAADLTDQELKRAITYMANKSGGSFPNPDEAAP 
I I I I M I I I I I ) I I I I I I I I M I I I I I : I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I 
WAPRIAQGFDTLFQHALNGFNAMPAKGGAVDLTDQELKRAITYMANKSGGSFPNPDEAAP 
110 120 130 140 150 160 

90 100 110 120 130 140 

ADNAASGTASAPADSAAPAEAKAEDKGAA- PAVGVDGKKVFEATCQVCHGGSI PGI PGIG 
I I I I I I t I I I I I I I I I I I 1 I 1 I I I I I I I t I M I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 
ADNAASGTASAPADSAAPAEAKAEDKGAAAPAVGVDGKKVFEATCQVCHGGSIPGIPGIG 
170 180 190 200 210 220 

150 160 170 180 190 200 

KKDDWAPRIKKGKETLHKHALEGFNAMPAKXGNAGLSDDEVKAAVDYMANQSGAKFX 



ml30 .pep 
gl30 

ml30 .pep 
gi30 



ml30 .pep 
al30 

ml30 .pep 
al30 

ml 3 0 . pep 
al30 

ml 30 .pep 
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I I I I I t I I I I I I I I I I I I I I I 1 1 I I I I I i I I I I 1 1 I I I 1 1 I It I I i I I I I I I I i I i 

a 1 3 0 KK DDWAPR I KKGKET LHKHALEG FNAMPAKGGN AGLS DDEVKAAVDYMANQSGAKFX 

230 240 250 260 270 280 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 505>: 

gl32.seq . 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT ATTAATATTA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGC7VAAGGC GCGGATGCCG 

101 GCGCGACCTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CCAACTTcct CAgccGCTCG AccGccGTTG CAGCAACAtt 

201 tttcttTGca acctgcAtgg gctatggTgt atattcacac CCACACGACA 

251 AAACACGGTT TGGACTtcagf caacataCGA CAGACTCAGC AagcACCCAA 

3 01 ACCcgtAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACagtTTTT CAAATgccga caTGgtga 

This corresponds to the amino acid sequence <SEQ ID 506; ORF 132.ng>: 

gl32 .pep 

1 MEAFKTLIWI INIISALAV I VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QHTTDSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 507>: 

ml32.seq (partial) 

1 ATGGAAGCCT TCAAAACCTT AATTTGGATT GTTAATTTAA TTTCCGCTTT 

51 GGCCGTCTTC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGA. . . 

This corresponds to the amino acid sequence <SEQ ID 508; ORF 132>: 

ml32.pep (partial) 

1 MEPFKTLIWI VNLISALAV F VLVLIjQHGKG ADAGATFG. . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF fi:om A^. gonorrhoeae 

ORF 132 shows 89.5% identity over a 38 aa overlap with a predicted ORF (ORF 132.ng) 
from N. gonorrhoeae: 

ml32/gl32 



10 20 30 

ml3 2 . pep MEPFKTLIWI VNLISALAVFVLVLLQHGKGADAGATFG 

II llllllhhIMIIhllllllllllllllllll 

gl32 MEAFKTLIWI INIISALAVIVLVLLQHGKGADAGATFGSGSGSAQGVFGSAGNANFLSRS 

10 20 30 40 50 60 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 509>: 

al32.seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT GTTAATATAA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CTAACTTCCT CAGCCGCTCG ACCGCCGTTG CAGCAACATT 

201 TTTCTTTGCA ACCTGCATGg GCTATGGTGT ATATTCACAC CCACACGACA 

251 AAACACGGTT TGGACTTCAG CAACGTACAA CAAACTCAGC AAGCACCCAA 

301 ACCCGTAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACAGTTTTT CAAATGCCGA CATGGTGA 

This corresponds to the amino acid sequence <SEQ ID 510; ORF 132.a>: 

al32 .pep 

1 MEAFKTLIWI VNIISALA VI VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QRTTNSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 



ml32/al32 92.1% identity in 38 aa overlap 
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10 20 30 

ml32 .pep MEPFKTLIWIVNLISALAVFVLVLLQHGKGADAGATFG 
II lllllllll:|NIII:IIIIIIIIMIIIIIIII 
al32 MEAFKTLIWIVNIISALAVIVLVLLQHGKGADAGATFGSGSGSAQGVFGSAGNANFLSRS 

10 20 30 40 50 60 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 511>: 

gl34 .seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCCGATGCGG GTAAAACCAC GCTGACCGAA AAACTGCTGC 

101 TGTTTTCGGG CGCGATTCAA AGCGCAGGCA CG6TGAAAGG TAAGAAAACC 

151 GGCAAATTCG CCACCTCCGA CTGGATGGAC ATCGAGAAGC AGCGCGGCAT 

201 TTCCGTGGCA TCAAGCGTGA TGCAGTTCX3A CTACAAAGAC CACACCGTCA 

251 ACCTCTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

301 GTTTTAACCG CAGTGGACAG CGCCTTGATG GTCATCGACG CGGCAAAAGG 

351 CGTGGAAGCG CAAACCATCA AACTCTTGAA CGTCTGCCGC CTGCGCGATA 

401 CGCCGATTGT TACCTTCATG AACAAATACG ACCGCGAAGT GCGCGATTCT 

451 TTGGAACTCT TGGACGAAGT GGAAGACATC CTGCAAATCC GCTGCGCGCC 

501 CGTTACCTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 

551 TCCTGAACGA CGAAATCTAT CTCTTTGAAG CGGGCGGCGA ACGCCTGCCG 

601 CACGAGTTCG ACATCATCAA AGGCATAAAC AATCCCGAAT TGGAACAACG 

651 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 

701 CGGCTTCCAA CGAATTTAAT CTCGacgaAT TTCTCGCcgG CGAACTCACG 

751 CCAGTGTTCT TCGGCTCTGC GATTAACAAC TTCGGCATTC AGGAAATCCT 

801 CAATTCATTG ATTGACTGGG CACCCGCACC GAAACCGCGC GACGCGACCA 

851 TGCGCATGGT CGGGCCGGAC GAGCCGAAAT TTTCCGGATT TATCTTTAAA 

901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATCG CCTTCTTGCG 

951 CGTCTGCTCC GGTAAATTCG AGCGCGGCAT GAAGATGAAA CACCTGCGTA 

1001 TCAACCGCGA AATCGCCGCC TCCAGCGTAG TAACCTTCAT GTCGCACGAC 

1051 CGCGAACTGG CGGAAGAAGC CTACGCCGGC GACATCATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTTACCGG CATCCCATTC TTCGCGCCCG AACTGTTCCG CAGCGTCCGC 

1201 ATCAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGTT TGCAACAACT 

1251 CGGCGAAGAA GGTGCGGTTC AAGTATTCAA ACCGATGAGC GGCGCGGATT 

1301 TGATTTTGGG TGCGGTCGGC GTGTTGCAGT TTGAAGTCGT AACCTCACGC 

1351 CTCGCCAACG AATACGGCGT GGAAGCCGTG TTCGACAGCG CATCCATCTG 

1401 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCGGAATTTG 

1451 AAAAAGCCAA CGCAGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 

1501 TACCTCGCCC CCAACCGCGT GAATTTGGGG TTGACGCAAG AACGCTGGCC 

1551 GGACATCGTG TTCCACGAAA CGCGCGAACA TTCGGTCAAA CTCTAA 

This corresponds to the amino acid sequence <SEQ ID 512; ORF 134.ng>: 

gl34 .pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD lEKQRGISVA SSVMQFDYKD HTWLIiDTPG HQDFSEDTYR 

101 VLTAVDSALM VIDAAKGVEA QTIKLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LEIiLDEVEDI LQIRCAPVTW PIQ1GKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIXKGIN NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IDWAPAPKPR DATMRMVGPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELAEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

401 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 

451 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AEFEKANAGN LAIDAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 513>: 

ml34 . seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCTGACGCAG GTAAAACCAC GTTGACTGAA AAACTCTTGC 

101 TGTTTTCGGG CGCGATTCAG AGCGCGGGTA CGGTAAAAGG CAAGAAAACC 

151 GGCAAATTCG CCACTTCCGA CTGGATGGAA ATCGAGAAGC AGCGCGGCAT 
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201 TTCCGTGGCA TCAAGTGTGA TGCAGTTCGA TTACAAAGAC CACACCGTCA 

251 ACCTCTT6GA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

301 GTTTTAACCG CCGTGGACAG CGCATTAATG GTCATCGACG CGGCAAAAGG 

351 CGTGGAAGCG CAAACCATCA AGCTCTTAAA CGTCTGCCGC CTGCGCGATA 

401 CACCGATTGT TACGTTTATG AACAAATACG ACCGCGAAGT GCGCGATTCC 

451 CTGGAACTTT TGGACGAAGT GGAAAACATT TTAAAAATCC GCTGCGCGCC 

501 CGTTACCTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 

551 TCCTGAACGA TGAAATTTAT CTCTTTGAAG CTGGCGGCGA ACGCCTGCCG 

601 CACGAGTTCG ACATCATCAA AGGCATCGAT AATCCTGAAT TGGAACAACG 

651 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 

701 CGGCTTCCAA CGAGTTTAAT CTCGACGAAT TCCTCGCCGG CGAACTCACG 

751 CCCGTATTCT TCGGCTCTGC GATTAACAAC TTCGGTATTC AGGAAATCCT 

801 CAATTCATTG ATTGACTGGG CGCCCGCGCC GAAACCGCGC GACGCGACCG 

851 TACGTATGGT CGAGCCGGAC GAGCCGAAGT TTTCCGGATT TATCTTCAAA 

901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATTG CCTTCTTGCG 

951 CGTCTGCTCC GGCAAATTCG AGCGCGGCAT GAAGATGAAA CACCTGCGTA 

1001 TCAACCGCGA AATCGCCGCC TCCAGCGTGG TTACCTTCAT GTCGCACGAC 

1051 CGCGAGCTGG TTGAAGAAGC CTACGCCGGC GACATTATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTCACCGG CATCCCATTC TTCGCACCCG AACTGTTCCG CAGCGTACGC 

1201 ATCAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGCT TGCAACAGCT 

1251 CGGCGAAGAA GGCGCGGTGC AGGTGTTCAA ACCGATGAGC GGCGCGGATT 

1301 TGATTTTGGG CGCGGTCGGC GTGTTGCAGT TTGAAGTCGT TACCTCGCGC 

1351 CTCGCCAACG AATACGGCGT AGAAGCCGTG TTCGACAGCG CATCCATCTG 

1401 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCTGAATTTG 

1451 AAAAAGCCAA CGCGGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 

1501 TACCTCGCGC CCAACCGCGT GAATTTGGGA CTCACGCAAG AACGTTGGCC 

1551 GGACATCGTG TTCCACGAAA CACGCGAACA TTCGGTCAAA CTGTAA 

This corresponds to the amino acid sequence <SEQ ID 514; ORF 134>: 

ml34 .pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWME lEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VIDAAKGVEA QTIKLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LELLDEVENI LKIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGID NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IDWAPAPKPR DATVRMVEPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

401 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEWT SR 

451 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AEFEKANAGN LAIDAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K ^gonorrhoeae 

ORF 134 shows 98.7% identity over a 531 aa overlap with a predicted ORF (ORF 134.ng) 
from N, gonorrhoeae: 

ml34/gl34 

ml34 .pep 
gl34 



ml34 .pep 
gl34 

70 80 90 100 110 120 

130 140 150 160 170 180 



10 20 30 40 50 60 

MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWME 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII: 
MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWMD 
10 20 30 40 50 60 

70 80 90 100 110 120 

I EKQRGI SVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 

IIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIilllllllll 

lEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 
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QTIKLIiNVCRLRDTPIVTFMNKYDREVRDSLEIiIiDEVENILKIRCAPVTWPIGMGKNFKG 

lllllllllllllllllllllllllllllllllllllhlhllllllllllllllllll 
QTIKLLNVCRLRDTPIVTFMNKYDREVRDSLELLDEVEDILQIRCAPVTWPIGMGKNFKG 
130 140 150 160 170 180 

190 200 210 220 230 . 240 

VYHILNDEIYLFEAGGERLPHEFDIIKGIDNPELEQRFPLEIQQLRDEIELVQAASNEFN 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
VYHILNDEIYLFEAGGERLPHEFDIIKGINNPELEQRFPLEIQQLRDEIELVQAASNEFN 
190 200 210 220 230 240 

250 260 270 280 290 300 

LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFIFK 

IllllllllllllllllUlllllllllllllllllllllllhlll I II II I II MM 

LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATMRMVGPDEPKFSGFIFK 
250 260 270 280 290 300 

310 320 330 340 350 360 

IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELVEEAYAG 

II II 1 1 1 1 1 II I II 1 1 1 1 1 II II I M II II 1 1 M 1 1 II I II 1 1 1 1 II M M 1 1 M 1 1 1 1 1 

IQANNIDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSVVTFMSHDRELAEEAYAG 
310 320 330 340 350 360 

370 380 390 400 410 420 

DIIGIPNHGNIQIGDSFSEGEQIiAFTGIPFFAPELFRSVRIKNPLKIKQtiQKGLQQLGEE 

MMIIMMIMMIMMMIIMMMIMIMMIMMIMMIMMIMMII 

DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 
370 380 390 400 410 420 

430 440 450 460 470 480 

GAVQVFKPMSGADLILGAVGVLQFEVVTSRLANEYGVEAVFDSASIWSARWVSCDDKKKL 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIMIIIMIIII 

GAVQVFKPMSGADLILGAVGVLQFEWTSRLANEYGVEAVFDSASIWSARWVSCDDKKKL 
430 440 450 460 470 480 

490 500 510 520 530 

AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIII 

AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 
490 500 510 520 530 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 515>: 

al34.seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCTGACGCAG GTAAAACCAC GTTGACTGAA AAACTCTTGC 

101 TGTTTTCAGG TGCGATTCAA AGCGCGGGTA CGGTAAAAGG CAAGAAAACC 

151 GGCAAATTCG CCACCTCCGA CTGGATGGAC ATCGAGAAGC AGCGCGGCAT 

201 TTCCGTGGCA TCAAGCGTGA TGCAGTTCGA CTATAAAGAC CACACCGTCA 

25i ACCTTTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

301 GTTTTGACCG CCGTCGATAG TGCCTTGATG GTCATCGACG CGGCAAAAGG 

351 CGTGGAAGCG CAAACCATCA AACTCTTGAA CGTCTGCCGC CTGCGCAATA 

401 CGCCGATTGT TACGTTCATG AACAAATACG ACCGCGAAGT GCGCGATTCC 

4 51 CTGGAATTGC TGGACGAAGT GG7\AAACATC CTGCAAATCC GCTGCGCGCC 

501 CGTAACCTGG CCGATCGGCA TGGGC AAAAA CTTCAAAGGC GTGTACCACA 

551 TCCTGAACGA CGAAATCTAT CTCTTTGAAG CGGGCGGCGA ACGCTTGCCG 

601 CACGAGTTCG ACATCATCAA AGGCATCGAT AATCCCGAAT TGGAACAACG 

651 CTTTCCGTTA GAAATACAGC AGTTGCGCGA CGAMTCGAA TTGGTGCAGG 

701 CGGCTTCCAA CGAGTTCAAT CTCGACGAAT TCCTCGCCGG CGAACTCACG 

751 CCCGTATTCT TCGGCTCTGC GATTAACAAC TTCGGTATTC AGGAAATCCT 

801 CAATTCATTG ATTGAATGGG CGCCCGCGCC GAAACCACGC GATGCGACCG 

851 TGCGTATGGT CGAGCCGGAC GAGCCGAAGT TTTCCGGATT TATCTTCAAA 

901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATTG CCTTCTTGCG 



ml34 .pep 
gl34 

ml34 .pep 
gl34 

ml34 -pep 
gi34 

ml34 .pep 
gl34 

ml34 .pep 

gi34 

tnl34 .pep 

gi34 

ml34 .pep 

gi34 
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951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501. 
1551 



CGTCTGCTCC 
TCAACCGCGA 
CGCGAGCTGG 
CCACGGCAAC 
CGTTTACCGG 
ATCAAAAACC 
TGGCGAAGAA 
TGATTTTGGG 
CTTGCCAACG 
GTCGGCGCGC 
AAAAAGCCAA 
TACCTCGCCC 
GGACATCGTG 



GGCAAATTCG 
AATCGCCGCC 
TTGAAGAAGC 
ATCCAAATCG 
CATCCCATTC 
CGCTGAAAAT 
GGTGCGGTGC 
CGCGGTCGGC 
AATACGGCGT 
TGGGTATCGT 
CGCGGGCAAC 
CTAACCGCGT 
TTCCACGAAA 



AGCGCGGCAT 
TCCAGCGTGG 
CTACGCCGGC 
GCGACAGCTT 
TTCGCGCCCG 
CAAGCAACTG 
AGGTGTTCAA 
GTGTTGCAGT 
GGAAGCCGTG 
GCGACGACAA 
CTCGCCATCG 
GAATCTGGGA 
CGCGCGAGCA 



GAAAATGAAA 
TAACCTTCAT 
GACATTATCG 
CTCCGAAGGC 
AACTGTTCCG 
CAAAAAGGTT 
ACCAATGAGC 
TTGAAGTCGT 
TTCGACAACG 
GA/VAAAACTG 
ACGCGGGCGG 
CTCACGCAAG 
TTCGGTCAAA 



CACCTGCGTA 
GTCCCACGAC 
GTATCCCAAA 
GAACAACTGA 
CAGCGTTCGC 
TGCAACAGCT 
GGCGCGGATT 
TACCTCGCGC 
CATCCATCTG 
GCGGAATTTG 
CAACCTCGCC 
AACGCTGGCC 
CTTTAA 



This corresponds to the amino acid sequence <SEQ ID 516; ORF 134.a>: 

al34 .pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD lEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VIDAAKGVEA QTIKLLNVCR LRNTPIVTFM NKYDREVRDS 

151 LELLDEVENI LQIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGID NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL lEWAPAPKPR DATVRMVEPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLTFTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 

4 51 LANEYGVEAV FDNASIWSAR WVSCDDKKKL AEFEKANAGN LAIDAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

ml34/al34 98.9% identity in 531 aa overlap 

10 20 30 40 50 60 

ml34 . pep MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWME 
I M I I I I I I I t I I I I I I I I I I I I I I I I I I I I t I i I I I I I I i M I I M M I i I I I I I I I I : 
al34 MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWMD 

10 20 30 40 50 60 



70 80 90 100 110 120 

inl34 . pep lEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 
I I I i M I M I I I I I I t i I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I i I I I i I I 
al34 lEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml34 . pep QTIKLLNVCRLRDTP I VTFMNKYDREVRDSLELLDEVEN I LKIRCAPVTW PIGMGKNFKG 

I I I I I I I I M I I : I I I I i I 1 I I I I I I i I I I I I I I I I I I I I I : I I t I I 1 I I I I I I I I I I I I 
al34 QTIKLLNVCRLRNTPIVTFMNKYDREVRDSLELLDEVENILQIRCAPVTWPIGMGKNFKG 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml34 . pep VYHILNDEIYLFEAGGERLPHEFDIIKGIDNPELEQRFPLEIQQLRDEIELVQAASNEFN 
I I I I i i t I I I I i I I I I I I I I i I I I I I I I I I I I M I I I I I I I I I I I M I I I I i I I I I I I I I 
al34 VYHILNDEIYLFEAGGERLPHEFDIIKGIDNPELEQRFPLEIQQLRDEIELVQAASNEFN 

190 200 210 220 .230 240 

250 260 270 280 290 300 

inl34 . pep LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFIFK 
M I I I M I I I I I I I I I ! I I I I I I I I It I I I I :[ I M I 1 I I I I I I I I I I I I i I I I I I I t I I 
al34 LDEFLAGELTPVFFGSAINNFGIQEILNSLIEWAPAPKPRDATVRMVEPDEPKFSGFIFK 

250 260 270 280 290 300 

310 * 320 330 340 350 360 

ml34 . pep IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELVEEAYAG 

III I I II I I II I I II I I I I t I II I I I I I I M t I I II N I II I I II I I I I I I I I I 

al34 IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELVEEAYAG 
310 320 330 340 350 360 
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370 380 390 400 410 420 

nil34 .pep DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 
) I I I I I I I I I M I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I M I I I 
al34 DIIGIPNHGNIQIGDSFSEGEQLTFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml34 .pep GAVQVFKPMSGADLILGAVGVLQFEWTSRLANEYGVEAVFDSASIWSARWVSCDDKKKL 
I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I N : I I I I I I I I t I i I I I I I I 
al34 GAVQVFKPMSGADLILGAVGVLQFEWTSRLANEYGVEAVFDNASIWSARWVSCDDKKKL 

430 440 450 460 470 480 

490 500 510 520 530 

ml34 .pep AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 
I I I I I I I I I I M I I I i I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I M I I I 
al34 AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 

490 500 510 520 530 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ E) 517>: 

gl35,seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCG 

51 TTCGGAcgGC AGCCTCTCGC GCGGCAAAAT CCAAACCATC ACCCGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCTG CAGGGTTCGG CGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 

251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCACAAATC 

301 CTGCTCAGCC GTGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

351 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCGATTCCC ATCATCAATG 

401 AAAACGACAC GGTTTCGGTT GAGGAGTTGA AAATCGGCGA CAACGACACA 

451 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 

501 GACCGACATA GACGGTCTTT ACACCGGCAA CCCGAACAGC AATCCCGATG 

551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT CATCGAAATG 

601 GCGGGCGGCT CGGGTTCGGC AAACGGCACG GGCGGTATGC TGACCAAAAT 

651 CAAAGCGGCA ACCATCGCCG CCX3AATCCGG CGTACCGGTG TAtATCTGTT 

701 CCTCACTCAA ACCCGATTCA TTGGCCGAAG CCGCCGAACA TCAGGCGGAC 

751 GGCTCGTTTT TCGTcccCcg tgCCAAAGGT TTGCGGACAC AGAAGCAATG 

801 GctggCGTTC TATTCcgaaa gcggGGgcag cgttTAtgtg gacgaaagtg 

851 cggaacacgc tTtgtccgaa caagggaaag cctgCTGA 

This corresponds to the amino acid sequence <SEQ ID 518; ORF 135.ng>: 

gl35.pep 

1 MKYKRIVFKV GTSSITRSDG SLSRGKIQTI TRQLAALHHA GHE LVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAIP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLLT DI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIAAESGVPV YICSSLKPDS LAEAAEHQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESGGSVYV DESAEHALSE QGKAC* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 519>: 

ml35 . seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCA 

51 TTCGGACGGC AGTCTCTCGC GCGGCAAAAT CCAAACCATC ACCTGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCGG CAGGGTTCGG TGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 

251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCGCAAATC 

301 CTGCTCAGCC GCGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

351 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCCGTCCCC ATCATCAATG 

4 01 AAAACGATAC GGTTTCGGTT GAGGAATTGA AAATCGGCGA CAACGACACA 

451 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 
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501 GACCGACATA GACGGTCTTT ACACGGGCAA CCCGAACAGC AATCCCGATG 

551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT CATCGAAATG 

601 GCGGGCGGCT CGGGTTCGGC AAACGGCACG GGCGGTATGC TGACCAAAAT 

651 CAAAGCGGCA ACCATCGCCG CCGAATCCGG CGTACCGGTG TATATCTGTT 

701 CCTCGCTCAA ACCCGATGCA CTTGCCGAAG CTGCCGAACA TCAGGCGGAC 

751 GGCTCGTTTT TCGTCCCCCG TGCCAAAGGT TTGCGGACGC AGAAGCAATG 

801 GCTGGCGTTC TATTCCGAAA GCCGGGGCAG CGTTTATGTG GACGAAGGTG 

851 CGGAACACGC TTTGTCCGAA CAGGGGAAAA GCCTGCTGAT GTCGGGCATT 

901 GCCGGAATCG AAGGGCATTT TTCCCGTATG GACACCGTAA CCGTGTACAG 

951 CAAGGC7ACC AAACAGCCCC TGGGCAAAGG GCGCGTCCTG TTCGGCTCTG 

1001 CCGCCGCCGA AGACCTGCTC AAATCGCGTA AGGCGAAAGG CGTGTTCATC 

1051 CATCGGGACG ACTGGATTTC CATCACGCCC GAAATACGCC TGCTTCTGAC 

1101 CGAATTTTAG 

This corresponds to the amino acid sequence <SEQ ID 520; ORF 135>: 

ml35 .pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TCQLAALHHA GHE IiVLVSSG 

51 AVAAGFGAIiG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIAAESGVPV YICSSLKPDA LAEAAEHQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESRGSVYV DEGAEHALSE QGKSLLMSGI 

301 AGIEGHFSRM DTVTVYSKAT KQPLGKGRVL FGSAAAEDLL KSRKAKGVFI 

351 HRDDWISITP EIRLLLTEF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 135 shows 97.6% identity over a 294 aa overlap with a predicted ORF (ORF 135.ng) 
from N. gonorrhoeae: 

ml35/gl35 

10 20 30 40 SO 60 

MKYKRI VFKVGTSS I THSDGSLSRGKI QTI TCQLAALHHAGHELVLVSSGAVAAGFGALG 

I II I II III Mill Ihll Mill I II II II IIIIIIIIIIMIIIIIIIIMIIIIII 

MKYKRIVFKVGTSSITRSDGSLSRGKIQTITRQLAALHHAGHELVLVSSGAVAAGFGALG 
10 20 30 40 50 60 

70 80 90 100 110 120 

FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

Ml II II III II Mill MM IIM I MM I Mill Mill MM II III I II III II II 

FKKRPVKIADKQT^AAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 
70 80 90 100 110 120 

130 140 150 160 170 180 

S VLLQRRAVPI INENDTVSVEELKI GDNDTLSAQVAAMIQADLLVLLTDI DGLYTGNPNS 

lllllllhlllllllllllllllllllllllllllllllllllllllllllllllllll 

SVLLQRRAI P IINENDTVSVEELKI GDNDTLSAQVAAMI QADLLVLLTDI DGLYTGNPNS 
130 140 150 160 170 180 

190 200 210 220 230 240 

NPDAVRLDKI EHINHE 1 1 EMAGGSGSANGTGGMLTKI KAATI AAESGVPVYI CSSLKPDA 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIII: 
NPDAVRLDKI EHINHEIIEMAGGSGSANGTGGMLTKIKAATIAAESGVPVYICSSLKPDS 
190 200 210 220 230 240 

250 260 270 280 290 300 

LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESRGSVYVDEGAEHALSEQGKSLLMSGI 

MMIIMMMMMMMMMIIMMMM 1 1 1 1 1 1 h I II I II II 1 1 = 

LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESGGSVYVDESAEHALSEQGKACX 
250 260 270 280 290 

310 320 330 340 350 360 



ml35.pep 
gl35 

ml35 .pep 
gi35 

ml35 .pep 

gi35 

ml35 .pep 
gl35 

Tnl35 .pep 
gl35 
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ml3 5 . pep AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 521>: 

al35. seq 

1 ATGAAATACA AAAGAATCGT 

51 TTCGGACGGC AGTCTCTCGC 

101 TTGCCGCATT GCATCATGCG 

151 GCGGTTGCGG CAGGGTTCGG 

201 AATCGCCGAC AAACAGGCTT 

251 AAGAATATAC GGCAAACCTG 

301 CTGCTCAGCC GCGCCGACTT 

351 CGGCGCACTT TCCGTGCTGC 

401 AAAACGATAC GGTTTCGGTT 

451 TTGAGTGCGC AAGTGGCGGC 

501 GACCGACATA GACGGTCTTT 

551 CCGTACGGCT GGACAAAATC 

601 GCGGGCGGCT CGGGTTCGGC 

651 CAAAGCGGCG ACGATTGCGA 

701 CCTCGCTCAA ACCCGATGCA 

751 GGCTCGTTTT TCGTCCCCCG 

801 GCTGGCGTTC TATTCCGAAA 

851 CGGAACACGC TTTGTCCGAA 

901 GCCGGAATCG AAGGGCATTT 

951 CAAGGCAACC AAACAGGCTT 

1001 CCGCCGCCGA AGACCTGCTC 

1051 CATCGGGACG ACTGGATTTC 

1101 CGAATTTTAG 

This corresponds to the amino acid sequence <SEQ ID 522; ORF 135. a>: 

al35,pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TRQLAALHHA GHE LVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIATESGVPV YICSSLKPDA LAEAADNQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESRGGVYV DEGAEHALSE QGKSLLMSGI 

301 AGIEGHFSRM DTVTVYSKAT KQPLGKGRVL FGSAAAEDLL KLRKAKGVFI 

351 HRDDWISITP EIRLLLTEF* 

ml35/al35 98.4% identity in 369 aa overlap 

10 20 30 40 50 60 

ml35 . pep MKYKRIVFKVGTSSITHSDGSLSRGKIQTITCQLAALHHAGHELVLVSSGAVAAGFGALG 

llttilMlillMIIIIIIIIIIIIIMII Mil I I I I I I t I I I I I I I I I I I I 

al35 MKYKRIVFKVGTSSITHSDGSLSRGKIQTITRQLAALHHAGHELVLVSSGAVAAGFGALG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml3 5 . pep FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 
I I I I I i I I I I 1 I I t I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I i I I I 
al35 FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ral35 . pep SVLLQRRAVPIINENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDI DGLYTGNPNS 

I I I I M I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al35 SVLLQRRAVPIINENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDIDGLYTGNPNS 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml35 . pep NPDAVRLDKIEHINHEIIEMAGGSGSANGTGGMLTKIKAATIAAESGVPVYICSSLKPDA 
I I I I i I I I I I I I I I I I I I I t I I I I I I I I I i I I I I I I I M I I t I : I I I I n I I I I I I I M I 
al35 NPDAVRLDKIEHINHEIIEMAGGSGSTU^GTGGMLTKIKAATIATESGVPVYICSSLKPDA 

190 200 210 220 230 240 



ATTTAAAGTC 
GCGGCAAAAT 
GGACACGAGC 
TGCGCTGGGT 
CCGCCGCCGT 
TCTTCAGACG 
TGCCGACAAA 
TGCAACGCCG 
GAGGAATTGA 
GATGATACAG 
ACACCGGCAA 
GAACACATCA 
AAACGGCACA 
CCGAGTCCGG 
CTTGCCGAAG 
TGCCAAAGGT 
GCAGGGGCGG 
CAGGGAAAAA 
TTCCCGTATG 
TGGGCAAAGG 
AAATTGCGTA 
CATCACGCCC 



GGCACATCTT 
CCAAACCATC 
TGGTCTTGGT 
TTCAAAAAAC 
CGGGCAGGGG 
GCATCGTGTC 
CGCCGCTACC 
CGCCGTCCCC 
AAATCGGCGA 
GCAGACCTCT 
CCCGAACAGC 
ACCATGAAAT 
GGCGGTATGC 
CGTACCGGTC 
CGGCAGATAA 
TTGCGGACGC 
CGTTTATGTG 
GCCTGCTGAT 
GACACCGTAA 
GCGAGTCCTG 
AGGCGAAAGG 
GAAATACGCC 



CGATTACCCA 
ACCCGCCAGC 
GTCTTCCGGC 
GTCCGGTCAA 
CTGCTGATGG 
CGCACAAATC 
AAAATGCCGG 
ATCATCAATG 
CAACGACACA 
TGGTGCTGCT 
AATCCCGATG 
CATCGAAATG 
TGACTAAAAT 
TATATCTGTT 
TCAGGCGGAC 
AGAAGCAATG 
GACGAAGGTG 
GTCGGGCATT 
CCGTGTACAG 
TTCGGCTCTG 
CGTGTTCATC 
TGCTTCTGAC 



wo 99/57280 




PCT/US99/09346 



250 260 270 280 290 300 

ml35 . pep LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESRGSVYVDEGAEHALSEQGKSLLMSGI 
I I I I I :: I I I I M I I I I I I I I I I I I I I I I I I I I I I I : t t I I i I I I I I I I I I I I I I I I I i I 
a 1 3 5 LAE AADNQADGS FFVPRAKGLRTQKQWLAFY SESRGGVYVDEGAEHALSEQGKSLI^SG I 

250 260 270 280 290 300 

310 320 330 340 350 360 

ral35.pep AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 
I I I I t I I I I I t I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I t M I M I t I I I I I I I I 
al35 AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKLRKAKGVFIHRDDWISITP 

310 320 330 340 350 360 

370 

ml 35. pep EI RLLLTE FX 

I I I I I I I I i I 
.al35 EIRLLLTEFX 
370 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 523>: 

gl36 .seq 

1 ATGGAAATCC GGTTTCAGAC AGCATTTTTA CGTTTGGTTC AGatgaAAAC 

51 AAACGCTtca aTTCTtaccg caACACGCCT TGTATTTCCT GccgCTGCCG 

101 CACGGACAGG GATCGTTCCT GCCGgtTTTT TCCCCTTCCC TGCGGACGGT 

151 TTGCGGTTTG TTGATGACCG CCTGCCAGTA GCGGTAGATG TCtgccagcg 

201 cgTAAGGCag tTCGGAcgca agttccgcca gctcgccttc ggTGAATTGC 

251 AGgcggataa cgccgtttTC CTCTTCGTCg taaatgccgc ccactgccat 

301 cacgGGGTAA AACAGCTCTT CAAACGCTTC ATCATCGGCG GCTTCAAACC 

3 51 AATCGGTCGG CACAATGTCC AAACCGTAAA GATAGGCGTT GCACCAAGTG 

4 01 TAAAAATCGC TGCCGCCCTC GCCGTCGTCG TAGAGCCACA AATCGGGCAG 
451 CTTTTTATCC GACATCGCGG CGGTTGTTTC CATCGCCATT GCCAAAACCA 
501 GCCGTTCGAT TTCGGAACGT TCGGCGGCGG TAAATTGCGA TTCGTCGCCC 
551 AACACTTC6G GCAGCCAGTC GAGCGGTGCC AATTTGTCCG GCCCGCTCAA 
601 CAGCGCCGTC ATAAAACCTT GAACCTCGTC GCAACGCATC GTGTTGCCTT 
651 GTTCGCTTTT GGCATCCAAT AA 

This corresponds to the amino acid sequence <SEQ ID 524; ORF 136.ng>: 

gl36 .pep 

1 MEIRFQTAFL RLVQMKTNAS ILTATRLVFP AAAARTGIVP AGFFPFPADG 

51 liRFVDDRLPV AVDVCQRVRQ FGRKFRQLAF GELQADNAVF LFWNAAHCH 

101 HGVKQLFKRF IIGGFKPIGR HNVQTV KIGV APSVKIAAAL AVW EPQIGQ 

151 LFIRHRGGCF HRHCQNQPFD FGTFGGGKLR FVAQHFGQPV ERCQFVRPAQ 

201 QRRHKTLNLV ATHRVALFAF GIQ* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 525>: 

ml36 .Beq 

1 ATGGAAACAA ACGCTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCGTTCCCTG 

101 CGGACGGTTT GCGGTTTGTT GATGACTGCC TGCCAGTAGC GGTAGATATC 

151 CGCCAATGCA TAAGGCAACT CGGATTCCAG TTCCGCCAGC TCGCCTTCTG 

201 TGAATTGCAG ACGGATAGCG CCGTTTTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACX3GC 

301 TTCAAACCAA TCGGTCGGCA CAATATCCAA ACCGTAAAGA TAAGCATTGC 

351 ACCATGTGTA AAAATCGCTG CCGCCGTCTT CGTTTTCATA CAGCCACAAA 

401 TCGGGCAGTT TTTTATCCGA CATCGCGGCG GTTGTTTCCA TCGCCATTGC 

451 CAAAACCAGC CGTTCGATTT CGGAACGTTC GGCGGCGGTA AATTGCGATT 

501 CGTCGCCCAA CACTTCGGGC AGCCAGTCGA GCGGTGTCAA TTTGTCCGGC 

551 CCGCTCAACA GCGCCGTCAT AAAACCTTGA ACCTCGTCGC AACGCATCGT 

601 GTTGCCTTGT TCGCTTTTGG CATCCAACAA TTCGCTCAAC CGCCGTTTGG 

651 ATGCTTCGGT AAATTTTCGG GAATCCATCA TTTTCCTTTT CAAATGGGTT 

701 TTGCGCCCTA TTATCGCCGC AATGCCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 526; ORF 136>: 



wo 99/57280 




PCT/US99/09346 



ml36 .pep 

1 METNASILTA TRLVFSAAAA RTGIVPACFF AFPADGLRFV DDCLPVAVDI 

51 RQCIRQLGFQ FRQLAFCELQ TDSAVFLFW NTAQCHDGIK QLFKRFIIDG 

101 FKPIGRHNIQ TV KISIAPCV KIAAAVFV FI QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 

201 VALFAFGIQQ FAQPPFGCFG KFSGIHHFPF QMGFAPYYRR NAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 136 shows 85.6% identity over a 209 aa overlap with a predicted ORF (ORF 136.ng) 
from K gonorrhoeae: 

ml36/gl36 

10 20 30 40 

ml3 6 . pep METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRFVDDCLPV 

hIMIIIIIIIIII Hlllllllll M lllllllllll III 

g 1 3 6 ME I RFQTAFLRLVOMKTNAS I LTATRLVFPAAAARTGI VPAGFFP FPADGLRFVDDRL PV 

10 20 30 40 50 60 

50 60 70 80 90 100 

ml3 6 . pep AVDIRQCIRQLGFQFRQLAFCELQTDSAVFLFWNTAQCHDGIKQLFKRFIIDGFKPIGR 
Ilh I :||:| :|lllll I I h h I I I I I I II : h I I hlllllllll IMIII! 
g 1 3 6 AVDVCQRVRQFGRKFRQIiAFGELQADNAVFLFWNAAHCHHGVKQLFKRF 1 1 GGFKP I GR 

70 80 90 100 110 120 

110 120 130 140 150 160 

ml3 6 . pep HNIQTVKISIAPCVKIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQKTQPFDFGTFGGGKLR* 

IhllllhHI Illllh I =:|lllhlllllllllllllllllllllllllllll 

gl3 6 HNVQTVKIGVAPSVKIAAAIiAVWEPQIGQLFIRHRGGCFHRHCQNQPFDFGTFGGGKLR 
130 140 150 160 170 180 

170 180 190 200 210 220 

tnl3 6 . pep FVAQHFGQPVERCQFVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIH 

lllllllllllllllllllllllllllllllllllllllllll 
gl3 6 FVAQHFGQPVERCQFVRPAQQRRHKTLNLVATHRVALFAFGIQX 
190 200 210 220 

230 240 
ml 3 6 . pep HFPFQMGFAPY YRRNAVX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 527>: 

al36.seq 

1 ATGGAAACAA ACGCTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCCTTCCCTG 

101 CGGACGGTTT GCGGCTTGTT GATGACCGCC TGCCAGTAGC GGTAGATATC 

151 CGCCAATGCA TAAGGCAACT CGGATTCCAG TTCCGCCAGC TCGCCTTCTG 

201 TGAATTGCAG ACGGATAGTG CCGTTGTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACGGC 

301 TTCAAACCAA TCGGTCGGCA CAATATCCAA ACCGTAAAGA TAAGCATTGC 

351 ACCATGTGTA AAAATCGCTG CCGCCGTCTT CGTTTTCATA CAGCCACAAA 

401 TCGGGCAGTT TTTTATCCGA CATCGCGGCG GTTGTTTCCA TCGCCATTGC 

451 CAAAACCAGC CGTTCGATTT CGGAACGTTC GGCGGCGGTA AATTGCGATT 

501 CGTCGCCCAA CACTTCGGGC AGCCAGTCGA GCGGTGTCM TTTGTCCGGC 

551 CCGCTCAACA GCGCCGTCAT AAAACCTTGA ACCTCGTCGC AACGCATCGT 

601 GTTGCCTTGT TCGCTTTTGG CATCCAACAA TTCGCTCAAC CGCCGTTTGG 

651 ATGCTTCGGT AAATTTTCGG GAATCCATCA TTTTCCTTTT CCAATGGGTT 

701 TTGCGCCCTA TTATAGTGGA TTAAATTTAA ATCAGGACAA GGCGACGAAG 

751 CCGCAGACAG TACAAATAGT ACGGCAAGGC GAGGCAACGC CGTACTGGTT 

801 TAAATTTAAT CCACTATATC GCCGCAATGC CGTCTGA 

This corresponds to the amino acid sequence <SEQ ID 528; ORF 136.a>: 

al36.pep 
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1 METNASILTA TRLVFSAAAA RTGIVPACFF AFPADGLRLV DDRLPVAVDI 

51 RQCIRQLGFQ FRQ LAFCELQ TDSAWLFW NTAQCHDGIK QLFKRFIIDG 

101 FKPIGRHNIQ TVK I5IAPCV KIAAAVFVFI QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 

201 VALFAFGIQQ FAQPPFGCFG KFSGIHHFPF PMGFAPYYSG LNLNQDKATK 

251 PQTVQIVRQG EATPYWFKFN PLYRRNAV* 

ml 36/al 36 98.3% identity in 238 aa overlap 

10 20 30 40 50 60 

ml36.pep METNAS I LTATRLVFSAAAARTGIVPACFFAFPADGLRFVDDCLPVAVD I RQCIRQLGFQ 

I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I t I I I I i I I I : i I I I I I I I I I I I I I I I I I I I 
al36 METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRLVDDRLPVAVDIRQCIRQLGFQ 
10 20 30 40 50 60 

70 ■ 80 90 100 110 120 

ml 3 6. pep FRQLAFCELQTDSAVFLFWNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 
I I I I I I t I I I I I I I I I I i I I I I I I I I I I I I i I I I I It I I I I I I I I I I I I I I I I I I I I I I 
al36 FRQLAFCELQTDSAWLFWNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 

70 80 90 100 110 120 

130 140 150 160 170 180 

inl36.pep KIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 
I I I I I I I I [ M I I I I i I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I i I I I M 
al36 KIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml3 6 . pep FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFQMGFAPYYRR 

I I I I M I I I I I t I I I I I I I I I I I I I I I I i I I I I I I I I I { I I I 1 I I t I I I I I I I I I I I 
al36 FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFPMGFAPYYSG 

190 200 210 220 230 240 



ml 3 6. pep NAVX 

al36 LNLNQDKATKPQTVQIVRQGEATPYWFKFNPLYRRNAVX 
250 260 270 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 529>: 

gl37 . seq 

1 ATGATTATCC ATCACcaaTT CGATCCCGTC CTCATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCT TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT GACATGGGGC ATTTTGGGCG TGATTTTGGG 

201 CGGACGCTTG GGCTATGTCC TGTTTTACAA ATTCTCCGAC TACCTCGCCC 

251 ATCCGCTTGA TATTTTCAAG GTATGGGAAG GCGGAATGTC GTTCCACGGC 

301 GGCTTTTTGG GTGTAGTTAT TGCCATATGG TTGTTCAGCC GCAAGCACGG 

351 CATCGGCTTC CTCAAACTGA TGGACACGGT CGCGCCGCTC GTTCCGCTGG 

401 GTCTCGCTTC GGGACGTATC GGCAACTTTA TCAACGGCGA ACTTTGGGGA 

451 CGCATTACCG ACATTAACGC ATTTTGGGCA ATGGGCTTCC CGCAAGCGCA 

501 TTACGAAGAT GCCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 

551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TTATCAGTTT 

601 GCCCTTGAAG GCATCTGCCT GTTCGCCGTC GTTTGGCTGT TTTCCAAAAA 

651 ACCGCGCCCG ACCGGGCAGA CTGCCGCGCT TTTTCTCGGC GGCTACGGCG 

701 TGTTCCGCTT TATTGCCGAA TTTGCGCGCC AACCCGACGA CTATCTCGGG 

751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG TCCCGATGAT 

801 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGGCATGAAA AAACAGCACT 

851 GA 

This corresponds to the amino acid sequence <SEQ ID 530; ORF 137,ng>: 

gl3 7 .pep 
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1 MIIHHQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR lAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGG MSFHG 

101 GFLGWIAIW LF SRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RITDINAFWA MGFPQAHYED AEAAAHNPLW AEWLQQYGML PRHPSQ LYQF 

201 ALEGICLFAV VWL FSKKPRP TGQTAALFLG GYGVFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 531>: 

ml37.seq 

1 ATGATTACCC ATCCCCAATT CGATCCCGTC CTTATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCC TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT GACATGGGGC ATTTTGGGCG TAATTTTGGG 

201 CGGGCGTTTG GGTTACGTCC TGTTTTACAA GTTTTCCGAC TACCTCGCCC 

251 ATCCGCTTGA TATTTTCAAG GTATGGGAAG GCGGAATGTC GTTCCACGGC 

301 GGCTTTTTGG GTGTAGTTAT TGCCATACGG TTGTTCGGCC GCAAACACGG 

351 CATCGGCTTC CTCAAACTGA TGGATACGGT CGCACCGCTC GTTCCGCTGG 

401 GTCTCGCTTC GGGACGTATC GGCAACTTCA TCAACGGCGA ACTTTGGGGA 

4 51 CGCGTTACCG ACATCAACGC ATTTTGGGCA ATGGGCTTCC CGCAGGCGCG 

501 TTACGAAGAT GCCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 

551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TTATCAGTTT 

601 GCACTTGAAG GCATCTGCCT GTTCACCGTC ATTTGGCTGT TCTCTAAAAA 

651 ACAGCGGTCG ACCGGACAAG TCGCCTCGCT CTTCCTCGGC GGCTACGGCA 

701 TATTCCGCTT CATTGCCGAA TTCGCACGCC AACCCGACGA CTATCTCGGG 

751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG TCCCGATGAT 

801 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGGCATGAAA AAACAGCACT 

851 GA 

This corresponds to the amino acid sequence <SEQ ID 532; ORF 137>: 
ml37.pep 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR lAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEG GMSFHG 

101 GFLGWIAIR LF GRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTDINAFWA MGFPQARYED AEAAAHNPLW AEWLQQYGML PRHPS QLYQF 

201 ALEGICLFTV IWL FSKKQRS TGQVASLFLG GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFGM K KQH* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 137 shows 95.4% identity over a 283 aa overlap with a predicted ORF (ORF 137.ng) 
fi-om N. gonorrhoeae: 

ml37/gl37 

10 20 30 40 50 60 

ml3 7.pep 
gl37 



ml37 .pep 
gl37 



ml37 .pep 
gl37 



tnl37 . pep 



MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 

II I Mlllllllllllllllllilllllllllllllllllllllllllllllllllll 
MIIHHQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 
10 20 30 40 50 60 

70 80 90 100 110 120 

I LGVI LGGRLGYVLFYKFSDYLAHPLDI FKVWEGGMSFHGGFLGWI AI RLFGRKHGIGF 

IIIIMMIIIillMllllllllllllllllllllllllllllillll IhllMIII 

ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIWLFSRKHGIGF 
70 80 90 100 110 120 

130 140 150 160 170 180 

LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDAEAAAHNPLW 

Mllllllllllllllllllllllllllllhlllllllllllllhlllllllllllll 

LKLMDTVAPLVPLGLASGRIGNFINGELWGRITDINAFWAMGFPQAHYEDAEAAAHNPLW 
130 140 150 160 170 180 

190 200 210 220 230 240 

AEWLQQY04LPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGIFRFIAE 
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gl37 AEWLQOYGMLPRHPSQLYQFALEGICLFAWWLFSKKPRPTGQTAALFLGGYGVFRFIAE 
190 200 210 220 230 240 

250 260 270 280 

ml37 . pep FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

llllllllllllllllllllllllllllllllllilllllllll 
gl3 7 FARQPDDYLGLLTLGLSMGQWLSVPMI VLGIVGFVRFGMKKQHX 

250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 533>: 

al37 .seq 

1 ATGATTACCC ATCCCCAATT CGACCCCGTC CTTATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCC TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT GACATGGGGC ATTTTGGGCG TAATTTTGGG 

201 CGGGCGTTTG GGTTACGTCC TGTTTTACAA GTTTTCCGAC TACCTCGCCC 

251 ATCCGCTTGA TATTTTCAAG GTATGGG/U^G GCGGAATGTC GTTCCACGGC 

301 GGCTTTTTGG GTGTAGTTAT TGCCATATGG TTGTTCGGTC GCAAACACGG 

351 CATCGGCTTC CTCAAACTGA TGGACACGGT CGCACCGCTC GTTCCACTGG 

4 01 GTCTCGCTTC GGGACGTATC GGCAACTTCA TCAACGGCGA ACTTTGGGGA 

451 CGCGTTACCG ACATCAACGC ATTTTGGGCA ATGGGCTTCC CGCAGGCGCG 

501 TTACGAAGAC CTCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 

551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TTATCAGTTT 

601 GCACTTGAAG GCATCTGCCT GTTCGCCGTC GTTTGGCTGT TCTCTAAAAA 

651 ACAGCGGCCG ACCGGACAAG TCGCCTCACT CTTCCTCGGC GGCTACGGCA 

701 TATTCCGCTT CATTGCCGAA TTTGCACGCC AACCCGACGA CTATCTCGGG 

751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG TCCCGATGAT 

801 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGGCATGAAA AAACAGCACT 

851 GA 



This corresponds to the amino acid sequence <SEQ E) 534; ORF 137.a>: 

al37 .pep 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILETFLG RRR lAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGG MSFHG 

101 GFLGWIAIW LF GRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTDINAFWA MGFPQARYED LEAAAHNPLW AEWLQQYGML PRHPSQ LYQF 

201 ALEGICLFAV VWL FSKKQRP TGQVASLFLG GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 

ml37/al37 98.2% identity in 283 aa overlap 

10 20 30 40 50 60 

ml37,pep MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 
I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I i I i I I I I I I I I I I I I t I I I I I I I I 
al37 MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml37 . pep ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIRLFGRKHGIGF 
I I I I I I I i 1 I I I I I I I I I I i 1 M I I I i I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I 
al37 ILGVILGGRLGYVLFYKFSDYL;U^PLDIFKVWEGG^5SFHGGFLGWIAIWLFGRKHGIGF 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml37 , pep LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDAEAAAHNPLW 
I I i I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I 1 I I I I I I 
al37 LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDLEAAAHNPLW 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml37 . pep AEWLQQYGMLPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGIFRFIAE 
I I t I I I I I I I I I I I I I I I I I I I I { I I I I : I : I I I I I I I I I I I I M I I I I [ I I I 1 I I I I I 
al37 AEWLQQYGMLPRHPSQLYQFALEGICLFAWWLFSKKQRPTGQVASLFLGGYGIFRFIAE 
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190 200 210 220 230 240 

250 260 270 280 

ml37 . pep FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 
I I i I I I I I I I I I I I M I I I I I I I I I I I I I I I I ) I I I I I I I I I I i 
al37 FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 
250 260 270 280 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 535>: 
gl38.seq 

1 ATGGAGTTTG AAAACATTAT TTCCGCCGCc gaCAAGGCGC GTATCCTTGC 

51 CGAAGCACTG CCTTACAtCC gccgGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AGTATGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAA7UUVGG 

251 GCGAATTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGAC GATGGATATT 

301 GTCGAAATGG TATTGGGCGG GCACGTCAAC AAGGAAATCG TGTCGATGAT 

351 TAACACATAT GGAGGGCACG CGGTCGGCGT GAGC6GGCGC GACGACCATT 

401 TCATTAAGGC GAAGAAACTT TTGGTCGATA CGCCCGAACA GAATAGCGTG 

451 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

501 AGGGCTGATA GAACGCGGCT GCATTCCCGT CGTCGCCCCC GTCGGCX3TAG 

551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT GGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAAtatcgc 

651 cgGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC acgCCGAAAC 

701 GGATTGATGG GCTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 

751 AAAATCGCTT CTGCGGTCGA AGCcgccgtc aACGGTGTGA AAGCCACGCA 

801 CATCATCGAC GGCAGGTTGC CCAACGCGCT TTTGCTGGT^ ATCTTTACCG 

851 ATGCCGGTAT CGGGTCGATG ATTTTAGGCA GAGGGGAAGA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 536; ORF 138.ng>: 

gl38.pep 

1 MEFENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 

51 RD WLLKLVG IHPVIVHGGG PQINAMLEKV GKKGEFVQGM RVTDKETMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LVDTPEQNSV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRIDGLIA DGTLYGGMLP 

251 KIASAVEAAV NGVKATHIID GRLP NALLLE IFTDAGIGSM ILG RGEDA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 537>: 

ml38 . seq 

1 ATGGAGTCTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

301 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

351 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

401 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 

451 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

501 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 

551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATATCGC 

651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 

701 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 

751 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGTGTGA AAGCCACGCA 

801 TATCATCGAC GGCAGGTTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 

851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 538; ORF 138>: 

ml38 .pep 

1 MESENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 
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51 RD WLLKLVG IHPVIVHGGG PQINAMLEKV GKKGEFVQGM RVTDKEAMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LIDTPEQNGV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

201 liAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRIDELIA DGTLYGGMLP 

251 KIASAVEAAV NGVKATHIID GRLP NALLLE IFTDAGIGSM IL GGGEDA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K ^gonorrhoeae 

ORF 138 shows 98.0% identity over a 298 aa overlap with a predicted ORF (ORF 138.ng) 
from N, gonorrhoeae: 

ml38/gl38 

10 20 30 40 50 60 

MESENIISAADKARILAEALPyiRRFSGSVAVIKyGGNAMTEPALKEGFARDWLLKLVG 

II IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

MEFENIISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDWLLKLVG 
10 20 30 40 50 60 

70 80 90 100 110 120 

IHPVIVHGGGPQINANn^EKVGKKGEFVQGMRVTDKEAMDIVEMVUSGHVNKEIVSMINTY 
MlllllllllllllllllllllllllllllllllhllllllinillllllllMIII 
IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKETMDIVEMVLGGHVNKEIVSMINTY 
70 80 90 100 110 120 

130 140 150 160 170 180 

GGHAVGVSGRDDHFIKAKKLLIDTPEQNGVDIGQVGTVESIDTGLVKGLIERGCIPWAP 

llllllllllllllllllllhlllllhlllllllllllllllllllllllllllllll 

GGHAVGVSGRDDHFIKAKKLLVDTPEQNSVDIGQVGTVESIDTGLVKGLIERGCIPWAP 
130 140 150 160 170 180 

190 200 210 220 230 240 

VGVGEKGEAFNINADLVAGKIiAEELNAEKLLM^mTIAGVMDKTGNLLTKLTPKRIDELIA 

llllllllllllllllllllllllllllllllllllllllllllllllllllllll III 

VGVGEKGEAFNINADLVAGKLAEELNAEKLLM^rmIAGVMDKTGNIJJTKLTPKRITCL 

190 200 210 220 230 240 

250 260 270 280 290 299 

DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRLPNALIiLEIFTDAGIGSMILGGGEDAX 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII Mill 

DGTIiYGGMLPKIASAVEAAVNGVKATHIIDGRLPNALLLEIFTDAGIGSMILGRGEDAX 
250 260 270 280 290 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 539>: 

al38,seq 

1 ATGGAGTCTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

301 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

351 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

401 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 

451 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

501 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 

551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATATCGC 

651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 

701 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 

751 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGCGTGA AAGCCACGCA 

801 TATCATCGAC GGCAGGGTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 

851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 



ral38 .pep 
gl3B 

ml38.pep 
gl38 

ml38 .pep 
gl38 

ml38 .pep 
gi38 

ml38.pep 
gi3B 
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This corresponds to the amino acid sequence <SEQ ID 540; ORF 138.a>: 

al38 .pep 

1 MESENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM 

51 RD WLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF 

201 LAEELNAEKL LMMTNIAGVM DKTGN LLTKL TPKRIDELIA 

251 KIASAVEAAV NGVKATHIID GRVPNALLLE IFTDAGIGSM 



ml 38/al38 99.7% identity in 298 aa overlap 

10 20 



30 



40 



TEPALKEGFA 
RVTDKEAMDI 
LIDTPEQNGV 
NINADLVAGK 
DGTLYGGMLP 
ILGGGEDA* 



50 



60 



ral38 . pep MESENIISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDWLLKLVG 
I M I I I I I I I I I I I I I M I I I I I 1 I I I I I i I I I I I I I I I I I I I I I I I ] I I I I I ] I I I I I I 
al38 MESENIISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDWLLKLVG 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml38 . pep IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKEAMDIVEMVLGGHVNKEIVSMINTY 
I i M M I I I I I I I i i I I I I I I I I I I I I I I I I I I I I I I I I j I I I I I I I I I I I I I I I I I I I I 
al38 IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKEAMDIVEMVLGGHVNKEIVSMXNTY 

70 80 90 100 110 120 

130 140 150 160 170 180 

inl38.pep GGHAVGVSGRDDHFIKAKKLLIDTPEQNGVDIGQVGTVESIDTGLVKGLIERGCIPWAP 

I I I I I I M ] I it I I I I I I I I I M I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I 

al38 GGHAVGVSGRDDHFIKAKKLLIDTPEQNGVDIGQVGTVESIDTGLVKGLIERGCrPWAP 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml38 . pep VGVGEKGEAFNINADLVAGKLAEELNAEKLLMMTNIAGVMDPCTGNLLTKLTPKRI DELIA 

I I M f I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M j I I I I I I M I I I I 
al38 VGVGEKGEAFNINADLVAGKLAEELNAEKLLMMTNIAGVMDKTGNLLTKLTPKRIDELIA 

190 200 210 220 230 240 

250 260 270 280 290 299 

ml38.pep DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRLPNALLLEIFTDAGIGSMILGGGEDAX 

IMIIIIIMIIIIIIIIIII|||||||||||:||Mlli]l Illlllll 

al38 DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRVPNALLLEIFTDAGIGSMILGGGEDAX 

250 260 270 280 290 



The following partial DNA sequence was identified in Kgonorrhoeae <SEQ ID 541>: 

gl3 9 .seq 

1 ATGCGAACCA CCTCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAggc ggcggcggag 

101 gcGGCACTTC TGCTCCCGAC TTTAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACGATAGC GGAATCAGCA GCAGTATCTT ACGCCGGTAT 

201 AAAAAACQAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAAAGCCCC CCGAATCTGC 

301 ATACCGGAGA CTTTTCAAAC CCAAATGACC AATATTAAGA ATATGATCAA 

351 CCTCAAACCT GCAATTGAAG CAGGCTATAC AGGACGCGGG GTAGAGGTAG 

401 GTATCGTCGA TACAGGCGAA TCCGTCGGCA GCATATCCTT TCCCGAACTG 

451 TATGGCAGAA AAGAACACGG CTATAACGAA AATTACAAAA ACAAATTACA 

501 AAAACTATAC GGCGTATATG CGGAAGGAAG CGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 542; ORF 138.ng>: 

gl3 9 .pep 

1 MRTTSTFPTK TFKPAAMA LA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 

51 NSRATIAESA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKIKAPRIC 

101 IPETFQTQMT NIKNMINLKP AIEAGYTGRG VBVGIVDTGE SVGSISFPEL 

151 YGRKEHGYNE NYKNKLQKLY GVYAEGSA* 
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The following partial DNA sequence was identified in Kmeningitidis <SEQ ID 543>: 

ml39 . seq 

1 ATGCGAACGA CCCCAACCTT CCCTACAAAA ACTTTCAAAC CGACTGCCAT 

51 GGOGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAGGC GGCGGCGGAG 

101 GCGGCACTTC TGCGCCCGAC TTCAATGCAG GCGGTACCGG TATCGGCAGC 

151 AACAGCAGAG CAACAACAGC GAAATCAGCA GCAGTATCTT ACGCCGGTAT 

201 CAAGAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAATGCCCC CCCCCGAATC 

301 TGCATACCGG AGACTTTCCA AACCCAAATG ACGCATtACA AGAATTTGAT 

351 CAACCTCAAA CCTGCAATTG AAGCAGGCTA TACAGGACGC GGGGTAGAGG 

401 TAGGTATCGT CGACACAGGC GAATCCGTCG GCAGCATATC CTTTCCCGAA 

451 CTGTATGGCA GAAAAGAACA CGGCTATAAC GAAAATTACG AAAAACTATA 

501 CGGCGTATAT GCGGAAGGAA GCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 544; ORF 138>: 

ml39.pep 

1 MRTTPTFPTK TFKPTAMA LA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 

51 NSRATTAKSA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 

101 CIPETFQTQM THYKNIjINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 

151 LYGRKEHGYN ENYEKLYGVY AEGSA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF firom Kf^onorrhoeae 

ORF 138 shows 92.2% identity over a 179 aa overlap with a predicted ORF (ORF 138.ng) 
from N. gonorrhoeae: 

ral39/gl39 
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10 20 30 40 50 60 

ml3 9 . pep MRTTPTFPTKTFKPTAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 

nil llllllllhlllllllllllllMIIIIIIIIIIIIIIIIIIIIIIIII hi! 

gl3 9 MRTTSTFPTKTFKPAAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATIAESA 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml3 9 . pep AVSYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETFQTQMTHYKNLINLK 

lllllllllllllllMllllllllllllllllhll lllllllllllll: Ihllll 
gl3 9 AVS YAGI KNEMCKDRSMLCAGRDDVAVTDRDAKIKAP -RI CI PETFQTQMTNI KNMINLK 

70 80 90 100 110 



130 140 150 160 170 
ml 3 9 . p ep PAI EAGYTGRGVEVGIVDTGESVGS I S FPELYGRKEHGYNENY EKL YGVYAEGSAX 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII :|lllllilllll 
gl3 9 PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYKNKLQKLYGVYAEGSAX 
120 130 140 150 160 170 



The following partial DNA sequence was identified in meningitidis <SEQ ID 545>: 

al39.seq 

1 ATGCGAACGA CCCCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAGGC GGCGGCGGAG 

101 GCGGCACTTC TGCGCCCGAC TTCAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACAACAGC GAAATCAGCA GCAATATCTT ACGCCGGTAT 

201 CAAGAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAATGCCCC CCCCCGAATC 

301 TGCATACCGG AGACTTTACA AACCCAAATG ACGCAT.ACA AGAATTTGAT 

351 CAACCTCAAA CCTGCAATTG AAGCAGGCTA TACAGGACGC GGGGTAGAGG 

4 01 TAGGTATCGT CGACACAGGC GAATCCGTCG GCAGCATATC CTTTCCCGAA 

451 CTGTATGGCA GAAAAGAACA CGGCTATAAC GAAAATTAC. AAAAACTATA 

501 CGGCGTATAT GCGGAAGGAA GCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 546; ORF 139.a>: 

al39 .pep 

1 MRTTPTFPTK TFKPAAMALA VATTLSA CLG GGGGGTSAPD FNAGGTGIGS 
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51 NSRATTAKSA AISYAGIKKE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 
101 CIPETLQTQM THXKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 
151 LYGRKEHGYN ENYXKLYGVY AEGSA* 

ml39/al39 97.1% identity in 175 aa overlap 

10 20 30 40 50 60 

ml39 . pep MRTTPTFPTKTFKPTAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 
I t I I I i I ! I i I I I I : 1 I I I I i I I 1 I I I I I I I I I I I I I I I I I I I I I ! I I I I I M I I I I I I I 
al39 MRTTPTFPTKTFKPAAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml39 . pep AVSYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETFQTQMTHYKNLINLK 
I : I I I I I I I 1 I i I I I I I i I I I I I I I I I I I I I I I I I I I I I i I I I I I : I I I I i I I I I I I I I 
al39 AISYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETLQTQMTHXKNLINLK 

70 80 90 100 110 120 

130 140 150 160 170 

ml39 . pep PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYEKLYGVYAEGSAX 
I I I I I I I I I I I i I I I I i I I I I I I I I 1 I i I I 1 I I I I I I I I I I I I I I I I I I I I i I I I 
al39 PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYXKLYGVYAEGSAX 

130 140 150 160 170 



The following partial DNA sequence was identified in Kgonorrhoeae <SEQ ID 547>: 

gl40.seq 

1 Atgtcggcac gCGGCAAGGG GGCAGgctat ctcAACAGTA CCGGACGACa 

51 TGTTCCCTTC CTGAGTGCCQ CCAAAATCGG GCAGGATTAT TCTTTCTTCA 

101 AAAATATCT^ AACCGACGGC GGTCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GCAGTGAAGG CGACACGCCG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CQTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

301 GAGCTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

351 GGTCGCCGAC CGCACAGATA TGCCGGGCAT CCGCCTACGG CGCACAACTT 

401 TCCGCACAGC GGCAGCCGTA CAGCATGCGA ATACCGCCGA CGGCGTACGC 

451 aTCTTcaaCA GTCTCGCCGC TAccgTCTAt GccgACAGTG CCGCCGCCCA 

501 TGccgATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 

551 ACAACGGTAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 

601 ACGTGGGAAC AGGGCGGTGT CGAAGGCAAA ATGCGCGGCA GTACCCAAAC 

651 TATCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 

701 TGGGCATAGG ACGCAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAJ^AACC 

751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGTGG GCGATATCGG 

801 CTATCTCAAA GGCCTGTTCT CctaCGGACG CTACAAAAAC AGCATCAGCC 

851 GCAGCACCGG TGCGGATGAA TATGCGGAAG GCAGCGTCAA CGGCACGCTG 

901 ATGCAGCTGG GCGCACTGGG TGGTGTCAAC GTTCCGTTTG CCGCAACGGG 

951 AGATTTGACG GTTGAAGGCG GTCTGCGCCA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCagt GCTTTGGGCT GGAGCGGCAA CAGCCTCACT 

1051 GAAGGCACAC TGGTCGGACT CGCGGGTCTG AAACTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTCTG CGACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CGCGGTAACG GGCGGCTTTA CCGGCGCGGC TGCAGCAACC 

1201 GGCAAGACGG GTGCACGCAA TATGCCGCAC ACCCGCCGGG TTGCCGGTCT 

1251 GGGGGTGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

1301 GCTACACCGG TTCCAAACAG TACGGCAACC ACAGCGGACA AATCGGCX5TA 

1351 GGCTACCGGT TCTGA 

This corresponds to the amino acid sequence <SEQ ED 548; ORF 140.ng>: 

gl40 .pep 

1 MSARGKGAGY LNSTGRHVPF LSAAKIGQDY SFFKNIKTDG GLLASLDSVE 

51 KTAGSEGDTP SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 

101 ELDASESSAT PETVETAVAD RTDMPGIRLR RTTFRTAAAV QHANTADGVR 

151 IFNSLAATVY ADSAAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 

201 TWEQGGVEGK MRGSTQTIGI AAKTGENTTA AATLGIGRST WSENSANAKT 
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251 DSISLFAGIR HDVGDIGYLK GLFSYGRYKN SISRSTGADE YAEGSVNGTL 
301 MQLGALGGVN VPFAATGDLT VEGGLRHDLL KQDAFAEKGS ALGWSGNSLT 
351 EGTLVGLAGL KLSQPLSDKA VLSATAGVER DLNGRDYAVT GGFTGAAAAT 
401 GKTGARNMPH TRRVAGLGVD VEFGNGWNGL ARYSYTGSKQ YGNHSGQIGV 
451 GYRF* 

The following partial DNA sequence was identified in N.meningitidis <SEQ ID 549>: 

ml40 . seq 

1 ATGTCGGCAC GCGGCAAGGG GGCAGGCTAT CTCAACAGTA 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCAGGATTAT 

101 CAAACATCGA AACCGACGGC GGCCTGCTGG CTTCCCTCGA 

151 AAAACAGCGG GCAGTGAAGG CX3ACACGCTG TCCTATTATG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA 

301 GAACTGGATG CCTCC6AATC ATCCGCAACA CCCGAGACGG 

351 GGCAGCCGAC CGCACAGATA TGCCGGGCAT CCGCCCCTAC 

401 TCCGCGCAGC GGCAGCCGTA CAGCATGCGA ATGCCGCCGA 

451 ATCTTCAACA GTCTCGCCGC TACCGTCTAT GCCGACAGTA 

501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC 

551 ACAACGGCAC GGGTCTGCGC GTCATCGCGC AAACCCAACA 

601 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA 

651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA 

701 TGGGCATGGG ACGCAGCACA TGGAGCGAAA ACAGTGCAAA 

751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG 

801 CTATCTCAAA GGCCTGTTCT CCTACGGACG CTACAAAAAC 

851 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA 

901 ATGCAGCTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG 

951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC 

1001 CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA 

1051 GAAGGCACGC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC 

1101 CX5ATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC 

1151 GACGCGACTA CACGGTAACG GGCGGCTTTA CCGGCGCGAC 

1201 GGCAAGACGG GGGCACGCAA TATGCCGCAC ACCCGTCTGG 

1251 GGGCGCGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG 

1301 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG 

13 51 GGCTACCGGT TCTGA 

This corresponds to the amino acid sequence <SEQ ED 550; ORF 140>: 

ml40 .pep 

1 MSARGKGAGY LNSTGRRVPF LSAAKIGQDY SFFTNIETDG GLLASLDSVE 
51 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 
101 ELDASESSAT PETVETAAAD RTDMPGIRPY GATFRAAAAV QHANAADGVR 
151 IFNSLAATVY ADSTAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 
201 TWEQGGVEGK MRGSTQTVGI AAICTGENTTA AATLGMGRST WSENSANAKT 
251 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 
301 MQLGALGGVN VPFAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNSLT 
351 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 
401 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV 
451 GYRF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K^onorrhoeae 

ORF 140 shows 94.5% identity over a 454 aa overlap with a predicted ORF (ORF 140.ng) 
from N, gonorrhoeae: 

ml40/gl40 

10 20 30 40 50 60 

MSARGKGAGYLNSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 

IIIIIIIIIIMIIIhlllllllllMIIIII Ihlllllllillllllllllllll 

MSARGKGAGYLNSTGRHVPFLSAAKIGQDYSFFKNIKTDGGLLASLDSVEKTAGSEGDTP 
10 20 30 40 50 60 

70 80 90 100 110 120 



CCGGACGACG 
TCTTTCTTCA 
CAGCGTCGAA 
TCCGTCGCGG 
CCCGCCGGTC 
CCTGATGGTC 
TTGAAACTGC 
GGCGCAACTT 
CGGTGTACGC 
CCGCCGCCCA 
GGGTTGGACC 
GGACGGTGGA 
GTACCCAAAC 
GCCGCCACAC 
TGCAAAAACC 
GCGATATCGG 
AGCATCAGCC 
CGGCACGCTG 
CCGCAACGGG 
AAACAGGATG 
CAGCCTCACT 
AACCCTTGAG 
GACCTGAACG 
TGCAGCAACC 
TTGCCGGCCT 
GCACGTTACA 
AGTCGGCGTA 



ml40 .pep 
gl40 
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ml 4 0 . pep S YYVRRGNAARTAS AAAHSAPAGLKHAVEQGGSNLENLMVELDASES SATPETVETAAAD 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllhll 
gl4 0 S YWRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENIiMVELDASESSATPETVETAVAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml4 0 . pep RTDMPGIRPYGATFRAAAAVQHANAADGVRI FNSLAATVYADSTAAHADMQGRRLKAVSD 

lillllll Hlhlllllllhlllllllllllllllllhllllllllllllllll 
gl4 0 RTDMPGIRLRRTTFRTAAAVQHANTADGVRIFNSLAATVYADSAAAHADMQGRRLKAVSD 
130 140 150 160 170 180 



190 200 210 220 230 240 

ml 4 0 . pep GLDHNGTGLRVI AQTQQDGGTWEQGGVEGK>mGSTQTVGI AAKTGENTTAAATLGMGRST 

llllllllllllllllllllllllllllllllllllhllllllllllllllllhllll 
gl40 GliDHNGTGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTIGIAAKTGENTTAAATLGIGRST 
190 200 210 220 230 240 



250 260 270 280 290 300 

ml40 .pep WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 

IIIMIIIIIIIIIIIIIIIIhllllllllllllllllllllllllllhlllllllll 

gl40 WSENSANAKTDSISLFAGIRHDVGDIGYLKGLFSYGRYKNSISRSTGADEYAEGSVNGTL 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml40 . pep MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 

lllllllllllllllllllll|||||:|||IIIIIIIIIIIMIIIIIIIIIIII||||| 
gl40 MQLGALGGVNVPFAATGDLTVEGGLRHDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 
310 320 330 340 350 360 



370 380 390 400 410 420 

ml4 0 . pep KLSQPLSDKAVLFATAGVERDl^GRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
Illlllllllll lllllllllllllhlllllllhllllllllllMIII llllhl 
gl40 KLSQPLSDKAVLSATAGVERDLNGRDYAVTGGFTGAAAATGKTGARNMPHTRRVAGLGVD 

370 380 390 400 410 420 



430 440 450 

ml40 . pep VEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRFX 

IIMMIIIIIIII|:||||||||||::|||IMI 
gl4 0 VEFGNGWNGLARYSYTGSKQYGNHSGQIGVGYRFX 
430 440 450 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 551>: 

al40, seq 

1 ATGTCGGCAG GCGGTAAGGG GGCAGGCTAT CTCAACCGTA CCGGACAACG 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCGGGATTAT TCTTTCTTCA 

101 CAAACATCGA AACCGACGGC GGTCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GTAGTGAAGG CGACACGCTG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

301" GAACTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

351 GGCCGCCGAC CGCACAGATA TGCCGGGCAT CCGCCCCTAC GGCGCAACTT 

4 01 TCCGCGCAGC GGCAGCCGTA CAGCATGCGA ATGCCGCCGA CGGTGTACGC 

451 ATCTTCAACA ATCTCGCCGC TACCGTCTAT GCCGACAGTA CCGCCGCCCA 

501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 

551 ACAACGCTAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 

601 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA GTACCCAAAC 

651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 

701 TGGGCATGGG ACACAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 

751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG GCGATATCGG 

801 CTATCTCAAA GGCCTGTTCT CCTACGGACG CTACAAAAAC AGCATCAGCC 

851 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA CGGCACGCTG 

901 ATGCAGCTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG CCGCAACGGG 
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951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA CAGCATCACT 

1051 GAAGGCACAC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CACGGTAACG GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 

1201 GGCAAGACGG GGGCACGCAA TATGCCGCAC ACCCGCCTGG TTGCCGGTCT 

1251 GGGCGCGGAT GTCGAATTCG GCMCGGCTG GAACGGCTTG GCACGTTACA 

1301 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG AGTCGGCGTA 

1351 GGCTACCGGT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 552; ORF 140.a>: 

al40.pep 

1 MSAGGKGAGY LNRTGQRVPF LSAAKIGRDY SFFTNIETDG GLLASLDSVE 
51 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 
101 ELDASESSAT PETVETAAAD RTDMPGIRPY GATFRATU^V QHANAADGVR 
151 IFNNLAATVY ADSTAAHADM QGRRLKAVSD GLDHNATGLR VIAQTQQDGG 
201 TWEQGGVEGK MRGSTQTVGI AAKTGENTTA AATLGMGHST WSENSANAKT 
251 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 
301 MQLGALGGVN VPFAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNSIT 
351 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 
401 GKTGARNMPH TRLVAGLGAD VEFGNGVfNGL ARYSYAGSKQ YGNHSGRVGV 
451 GYRF* 

ml40/al40 98.2% identity in 454 aa overlap 

10 20 30 40 50 60 

ml 40. pep MSARGKGAGYLNSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 
III I I I I I II I I I : I M I I I I I II I : I I I II I I I I I! I I I I I I I II 11 M I 1 I I I I II 
al4 0 MSAGGKGAGYLNRTGQRVPFLSAAKIGRDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 

10 20 30 40 50 60 

70 . 80 90 100 110 120 

ml 4 0 , pep SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPETVETAAAD 
I I I I t I I I t I I I I I I I I I I II I M I M i I I 1 I I I I M I I I I I I I I I I II I I I I I I I i I I I 
al 4 0 SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPETVETAAAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml40 . pep RTDMPGIRPYGATFRAAAAVQHANAADGVRIFNSLAATVYADSTAAHADMQGRRLKAVSD 
I I I I t I I I I I I I I I I I I I I i I I I 1 I II I I I I I I : I I I I I I I I I I I I II I I I I II I I I I I I 
al4 0 RTDMPGIRPYGATFRAAAAVQHANAADGVRIFNNLAATVYADSTAAHADMQGRRLKAVSD 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 4 0 . pep GLDHNGTGLRVI AQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGRST 

I I I I I ' I I I I M I I I I I I I I I I I I I II I I I I I I I II I I II I I II I I II I I I M M M : I I 
al 4 0 GLDHNATGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGHST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml40 . pep WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 
I I I I I I II I I I I I I I I I II I M I I II I I I I I II I I I I I II I I I I I I II I I I I I I M I I I I 
al4 0 WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 40. pep MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 
I I I I I I I I I I I I M I II M I I I I I I M I I I t I I I I I I I I I I II I I I I I : I I I I I I I II I I 
al40 MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSITEGTLVGLAGL 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml4 0 . pep KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I II I I M I I I I I I I 11 I I II I II I II I 
al40 KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 



wo 99/57280 



PCT/US99/09346 



398 



370 380 390 400 410 420 

430 440 450 

ml 4 0 . pep VEFGNGWNGLARYS YAGSKQYGNHSGRVGVGYRFX 

I I I t M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al 4 0 VEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRFX 

430 440 450 



The following partial DNA sequence was identified in Kgonorrhoeae <SEQ ID 553>: 

gl41.seq 

1 atgagcttca aAAccgATGC CGAAACCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTGGGTTT GAACGTTGAC AACATTGAGC 

101 CTTACGGTCA TTACAAAGCC AAAATCAATC CTGCCGAAGC GTTCAAGCTG 

151 CCGCAAAAAC AAGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 

201 GGCGGGCGAA GGCAAAACCA CCGTAACCAT CGGTTTGGCG GACGCATTGC 

251 GCCATATCGG CAAAGACTCT GTGATTGCTT TGCGCGAGCC TTCTTTGGGT 

301 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ACGCGCAAGT 

351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGCGAC TTCCACGCCA 

401 TCGGTGCGGC GAATAACCTC CTCGCCGCCA TGCTCGACAA CCATATCTAC 

451 CAAGGTAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 

501 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGTATGGGCA 

551 AGCCTGTtga cggCGTGATG CGtcccGACG GCTTCGACAT CACCGTCGCC 

601 TCCGAAGTGa tggcgGTATT CTGCCTTGCC AAAGACATCA GCGATTTGAA 

651 AGAGCGTTtt gGCAATATTC TCGTCGCCTA CGCO^AAGAC GGCAGCCCCG 

701 TTTACGCCAA AGATTTGAAG GCACACGGCG CGATGGCGGC ATTGCTAAAA 

751 GATGCGATTA AGCCCAATTT GGTGCAAACC ATCGAAGGCA CTCCGGCCTT 

801 TGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTTA 

851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 

901 GGCTTCGGCG CGGACTTGGG TGCGGAAAAA TTCTGCGACA TCAAATGCCG 

951 CCTTGCCGGT TTGAAACCTG ATGCGGCAGT CGTCGTGGCG ACTGTCCGCG 

1001 CCCTGAAATA CAACGGCXXSC GTGGAACGCG CCAACCTTGG TGAAGAAAAC 

1051 CTCGAAGCCT TGGCAAAAGG TTTGCCCAAC CTGTTGAAAC ACATTTCCAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGTCCGACTC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCT^G GCGGCGCGGG 

1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA TGCCATCGAC AACCAACCTA 

1301 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGACAAAATC 

1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTCG ATTTCAGCGC 

1401 GGAAGCGTCT GCCGAAATCG CCTCGCTGGA AAAACTGGGC TTGGACAAAA 

1451 TGCCGATCTG CATGGCGAAA ACCCAATATT CATTGAGCGA CAACGCCAAA 

1501 CTCTTGGGCT GCCCCGAAGG CTTCCGCATC GCCGTAC6CG GTATCACTGT 

1551 TTCCGCCGGC GCGGGCTTCA TCGTTGCGTT GTGCGGCAAT ATGATGAAAA 

1601 TGCCGGGCCT GCCGAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGAA 

1651 CACGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 554; ORF 141.ng>: 

gl41 .pep 

1 MSFKTDAETA QSSTMRPIGE lAAKLGLNVD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTIGLA DALRHIGKDS VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPD GFDITVA 

201 SEVMAVFCLA KDISDLKERF GNILVAYAKD GSPVYAKDLK AHGAMAALLK 

251 DAIKPNLVQT lEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 

301 GFGADLGAEK FCDIKCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 

351 LEALlAKGLPN LLKHISNLKN VFGLPVWAL NRFVSDSDAE LAMIEKACAE 

401 HGVEVSLTEV WGKGGAGGAD LARKWNAID NQPNNFGFAY DVELGIKDKI 

451 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 

501 LLGCPEGF RI AVRGITVSAG AGFIVA LCGN MMKMPGLPKV paaekidvde 

551 HGVIHGIiF* 

The following partial DNA sequence was identified in Kmeningitidis <SEQ ID 555>: 
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ml41. seq 

1 ATGAGCTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTTGGTCT GAATGCCGAC AACATTGAGC 

101 CTTACGGTCA TTACAAGGCG AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 

201 GGCGGGCGAA GGCAAAACCA CCGTAACCAT CGGTTTGGCG GACGCGTTGC 

251 GCCACATCGG CAAAGATGCC GTGATTGCCC TGCGCGAACC TTCTCTGGGG 

301 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 

351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGAGAT TTTCACGCCA 

401 TCGGTGCGGC AAATAATCTG CTTGCCGCGA TGCTCGACAA CCATATCTAC 

451 CAAGGCAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 

501 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGCATGGGTA 

551 AACCCGTTGA CGGCX5TGATG CGTCCTGACG GTTTCGATAT TACCGTTGCT 

601 TCCGAAGTGA TGGC6GTATT CTGTCTTGCC AAAGACATCA GCGATTTGAA 

651 AGAGCGTTTG GGCAACATCC TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 

701 TTTACGCCAA AGATTTGAAA GCGAATGGCG CGATGGCGGC ATTGCTTAAA 

751 GATGCX3ATTA AGCCCAACTT GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 

801 CGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTAA 

851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 

901 GGCTTCGGCG CGGACTTGGG CGCGGAAAAA TTCTGCGACA TCAAATGCCG 

951 CCTTGCCGGT TTGAAACCTG ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 

1001 CGTTGAAATA TAACGGCGGC GTGGAACGCG CCAACCTCGG CGAAGAAAAT 

1051 TTAGACGCTT TGGAAAAAGG TTTGCCCAAC CTGCTGAAAC ACATTTCCAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGTCCGACGC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 

1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA CGCCATTGAA AGTCAAACCA 

1301 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGACAAAATC 

1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTTG ATTTCAGCGC 

1401 GGAAGCGTCT GCCGAAATCG CTTCACTGGA AAAACTGGGC TTGGACAAAA 

1451 TGCCGATCTG CATGGCGAAA ACCCAATACT CTTTGAGCGA CAACGCCAAA 

1501 CTGTTGGGCT GCCCCGAAGA CTTCCGCATC GCCGTGCGCG GCATCACCGT 

1551 TTCCGCAGGC GCAGGTTTCA TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 

1601 TGCCCGGCCT GCCCAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGCA 

1651 GAAGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 556; ORF 141>: 

ml41 .pep 

1 MSFKTDAEIA QSSTMRPIGE lAAKLGLNAD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTIGLA DALRHIGKDA VIALREPSLG 

101 PVFGVKGGAA GGG YAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPD GFDITVA 

201 SEVMAVFCLA KDISDLKERL GNILVAYAKD GSPVYAKDLK ANGAMAALLK 

251 DAIKPNLVQT lEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 

301 GFGADLGAEK FCDIKCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 

351 LDALEKGLPN LLKHISNLKN VFGLPVWAL NRFVSDADAE LAMIEKACAE 

401 HGVEVSLTEV WGKGGAGGAD LARIOA/NAIE SQTNNFGFAY DVELGIKDKI 

451 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 

501 LLGCPEDFR I AVRGITVSAG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDA 

551 EGVIHGLF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K^onorrhoeae 

ORF 141 shows 97.5% identity over a 558 aa overlap with a predicted ORF (ORF 141. ng) 
from N. gonorrhoeae: 

ml41/gl41 



10 20 30 40 50 60 

ml41.pep MSFKTDAEIAQSSTMRPIGEIAAKLGIiNADNIEPYGHYKAKINPAEAFKLPQKQGRLrLV 

IIIIMII IMMIIIIIIIIIIIIIhllllllllllllMMIIIIIIIIIIIIMI 

9141 MSFKTDAETAQSSTMRPIGEIAAKLGLNVDNIEPYGHYKAKINPAEAFKLPQKQGRLILV 
10 20 30 40 50 60 
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70 80 90 100 110 120 

ml 4 1 . pep TAI NPTPAGEGKTTVTI GLADALRHI GKDAVI ALRE PSLGP VFGVKGGAAGGGYAQVLPM 

llllllllllllllllllllllllllllhlllllllllllllliiiiiiiiiiiiiiii 

gl41 TAINPTPAGEGKTTVTIGLADALRHIGKDSVIALREPSLGPVFGVKGGAAGGGYAQVLPM 
70 80 90 100 110 120 



130 140 150 160 170 180 

ml41 .pep EDINLHFTGDFHAIGAANNLIJy^MLDNHiyQGNELNIDPKRVLWRRVVDf^ 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

gl41 EDINLHFTGDFHAIGAANNLLAAMLDNHIYQGNELNIDPKRVLWRRWDMNDRQLRNIID 
130 140 150 160 170 180 



190 200 210 220 230 240 

ml41 . pep GMGKPVDGVMRPDGFDITVASEVMAVFCIiAKDISDLKERLGNILVAYAKDGSPVYAKDLK 

llllllllllllllllllllllllllllllllllllllhllllllllllllllllllll 

g 1 4 1 GMGKPVDGVMRPDGFDI TVASEVMAVFCLAKDI SDLKERF6NI LVAYAKDGS PVYAKDLK 

190 200 210 220 230 240 



250 260 270 280 290 300 

ral41 . pep ANGAMAALLKDAIKPmVQTIEGTPAFVHGGPFANIAHGCNSVTATRLAKHLADYAVTEA 

hllllllllllllllllllllllllllllllllllllllllllllllllMIIIIIIII 

gl4 1 AHGATIAALLKDAIKPNLVQTIEGTPAFVHGGPFANIAHGCNSVTATRLAKHLADYAVTEA 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml41 .pep GFGADLGAEKFCDIKCRLAGLKPDAAVWATVRALKYNGGVERANIiGEENLDALEKGLPN 
lllllllllllllllllllllll||l|||||||||||||||||||||||||:|| Mill 
gl4 1 GPGADLGAEKFCDIKCRIiAGLKPDAAVWATVRALKYNGGVERANLGEENLEAliAKGLPN 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 4 1 . pep LLKHI SNIjKNVFGLPWVALNRFVSDADAEIiAMI EKACAEHGVEVS LTEVWGKGGAGGAD 

lllllllllllllllllllllllllhllllllllllllllllllllllliiilllllll 

g 1 4 1 LLKHI SNLKNVFGLPVWALNRFVS DSDAELAMI EKACAEHGVEVS LTEVWGKGGAGGAD 

370 380 390 400 410 420 



430 440 450 460 470 480 

ml41 . pep LARKWNAIESQTNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 

III III 1 1 h:i II II I Mill nil mil mil nil Mill I II II I III! I III 

g 1 4 1 LARKWNAIDNQPNNFGFAYDVELGI KDKI RAI AQKVYGAEDVDFS AEASAE I ASLEKLG 

430 440 450 460 470 480 



490 500 510 520 530 540 

ml41 . pep LDKMPICMAKTQYSLSDNAKLLGCPEDFRIAVRGI WSAGAGFIVALCGNMMKMPGLPKV 

IIMIMIIMIIIIIIIIIIIIIII Illlllllllllllllllllllllllll 

gl41 LDKMPICMAKTQYSLSDNAKLLGCPEGFRIAVRGITVSAGAGFIVALCGNMMKMPGLPKV 
490 500 510 520 530 540 



550 559 
ml41 . pep PAAEKIDVDAEGVIHGLFX 

lllllllll 
gl41 PAAEKIDVDEHGVIHGLFX 

550 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 557>: 

al41 , seq 

1 ATGAGTTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTGGGTTT GAACGTTGAC AACATTGAGC 

101 CTTACGGTCA TTACAAAGCC AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 
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201 GGCGGGCGAA GGTAAAACCA 

251 GCCATATCGG CAAAGACTCT 

301 CCGGTGTTCG GCGTGAAAGG 

351 TTTGCCGATG GAAGACATCA 

4 01 TCGGTGCGGC AAATAATCTG 

451 CAAGGCAACG AGTTGAACAT 

501 GGTCGATATG AACGACCGCC 

551 AGCCTGTTGA CGGCGTGATG 

601 TCCGAAGTGA TGGCGGTATT 

651 AGAGCGTTTG GGCAACATCC 

701 TTTACGCCAA AGATTTGAAA 

751 GATGCGATTA AGCCCAACTT 

801 CGTACACGGC GGCCCGTTCG 

851 CCGCAACCCG TCTGGCGAAA 

901 GGCTTCGGCG CGGACTTGGG 

951 CCTTGCCGGT TTGAAACCTG 

1001 CGTTGAAATA TAACGGCGGC 

1051 TTAGACGCTT TGGAAA/yVGG 

1101 CCTGAAAAAC GTATTCGGAC 

1151 TGTCCGACTC CGATGCCGAG 

1201 CACGGCGTTG AAGTTTCCCT 

1251 CGGCGCGGAT TTGGCGCGCA 

1301 ATAACTTCGG TTTCGCCTAC 

1351 CGTGCGATTG CCCAAAAAGT 

1401 GGAAGCGTCT GCCGAAATCG 

14 51 TGCCGATCTG CATGGCGAAA 

1501 CTGTTGGGCT GCCCCGAAGA 

1551 TTCCGCAGGC GCAGGTTTCA 

1601 TGCCCGGCCT GCCCAAAGTT 

1651 GAAGGCGTGA TTCACGGCTT 



CCGTAACCAT CGGTTTGGCG GACGCATTGC 
GTGATTGCTT TGCGCGAGCC TTCTTTGGGT 
CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 
ACCTGCACTT CACCGGAGAT TTTCACGCCA 
CTTGCCGCGA TGCTCGACAA CCATATCTAC 
CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 
AGTTGCGCAA CATCATCGAC GGCATGGGCA 
CGTCCTGACG GTTTCGATAT TACCGTTGCT 
CTGTCTTGCC AAAGACATCA GCGATTTGAA 
TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 
GCGAATGGCG CGATGGCGGC ATTGCTTAAA 
GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 
CCAACATCGC CCACGGCTGC AACTCCGTAA 
CACCTTGCCG ATTACGCCGT AACCGAAGCA 
CGCGGAAAAA ■ TTCTGCGACA TCAAATGCCG 
ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 
GTGGAACGCG CCAACCTCGG CGAAGAAAAT. 
TTTGCCCAAC CTGCTGAAAC ACATTTCCAA' 
TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 
TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 
GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 
AAGTCGTCAA CGCCATTGAA AGTCAAACCA 
GATGTCGAGT TGGGCATCAA AGACAAAATC 
GTACGGCGCG GAAGATGTTG ATTTCAGCGC 
CTTCACTGGA AAAACTGGGC TTGGACAAAA 
ACCCAATACT CTTTGAGCGA CAACGCCAAA 
CTTCCGCATC GCCGTGCGCG GCATCACCGT 
TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 
CCGGCTGCCG AGAAAATCGA TGTGGACGCA 
GTTCTGA 



This corresponds to the amino acid sequence <SEQ ID 558; ORF 141. a>.: 

al41 .pep 

1 MSFKTDAEIA QSSTMRPIGE lAAKLGLNVD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTIGLA DALRHIGKDS VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPD GFDITVA 

201 SEVMAVFCLA KDISDLKERL GNILVAYAKD GSPVYAKDLK ANGAMAALLK 

251 DAIKPNLVQT lEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEJ\ 

301 GFGADLGAEK FCDIKCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 

351 LDALEKGLPN LLKHISNLKN VFGLPVWAL NRFVSDSDAE LAMIEKACAE 

401 HGVEVSLTEV WGKGGAGGAD LARKWNAIE SQTNNFGFAY DVELGIKDKI 

451 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 

501 LLGCPEDFR I AVRGITVSAG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDA 

551 EGVIHGLF* 

ml41/al41 99.5% identity in 558 aa overlap 

10 20 30 40 50 60 

ml41.pep MSFKTDAEIAQSSTMRPIGEIAAKLGLNADNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

I i I M I I M I I I I I I I I I I I i I : I I I I I I I I I I I I I I I I I I ! II 

al41 MSFKTDAEIAQSSTMRPIGEIAAKLGLNVDNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml41.pep TAINPTPAGEGKTTVTIGLADALRHIGKDAVIALREPSLGPVFGVKGGAAGGGYAQVLPM 
I I I I I I ) I I I t I I I I I I I I I I I I I I I I I I : I j I I I I I I I I I I I I I t I I I I I I I I I I I I I I 
al41 TAINPTPAGEGKTTVTIGLADALRHIGKDSVIALREPSLGPVFGVKGGAAGGGYAQVLPM 

70 80 90 100 110 120 

130 140 150 160 170 180 

inl4 1 . pep EDINLHFTGDFHAIGAANNLLAAMLDNHIYQGNELNIDPKRVLWRRWDMNDRQLRNIID 
- I 1 I I I I I I I M I I I t I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al41 EDINLHFTGDFHAIGAANNLLAAMLDNHIYQGNELNIDPKRVLWRRWDMNDRQLRNIID 

130 140 150 160 170 180 
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190 200 210 220 230 240 

ml 4 1 . pep GMGKPVDGVMRPDGFDITVASEVMAVFCLAKDISDLKERLGNILVAYAKDGSPVYAKDLK 
I I I I I t I I I I I I I I I I I I I I I I M I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al 4 1 GMGKPVDGVMRPDGFDITVASEVMAVFCLAKDISDLKERLGNILVAYAKDGSPVYAKDLK 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml4 1 . pep ANGAMAALLKDAIKPNLVQTIEGTPAFVHGGPFANIAHGCNSVTATRLAKHLADYAVTEA 
I I I I I I I I I I I I M I I I I I I I I I I I I I I ) I I I I I I I I I I I I I I I t I I I ) I I I I I I I I I I I 
al41 ANGAMAALLKDAIKPNLVQTIEGTPAFVHGGPFANIAHGCNSVTATRLAKHLADYAVTEA 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml 4 1 . pep GFGADLGAEKFCDIKCRL^GLKPDAAVWATVRALKYNGGVERANLGEENLDALEKGLPN 
I I t I I I I I I I I I I I I I I I I I I I I I i i I I I I I I I I I i I I I I I I I I I I I I I I I I I t I I I I I I 
al41 GFGADLGAEKFCDIKCRLAGLKPDAAVWATVRALKYNGGVERANLGEENLDALSKGLPN 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 4 1 . pep LLKHISNLKNVFGLPVWALNRFVSDADAELAMIEKACAEHGVEVSLTEVWGKGGAGGAD 
t I I I ! I I I I I I I i I I I I I I I I I I I I I : t I I I I I I I I I I I I I I I I I f M I M I I I I I I I I I 
al41 LLKHISNLKNVFGLPVWALNRFVSDSDAELAMIEKACAEHGVEVSLTEVWGKGGAGGAD 

370 380 390 400 410 420 



430 440 450 460 470 480 

ml41,pep LARKWNAIESQTNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 
M I I I I M I I I I I I I I t I I I I I I I i I j I I I I I I ] I I I I I 1 I I I I I I I M I I I I I I I I I I I 
al41 LARKWNAIESQTNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml41,pep LDKMPICMAKTQYSLSDNAKLLGCPEDFRIAVRGITVSAGAGFIVALCGNMMKMPGLPKV 
I I I t I M I I I I I I M I 1 I I I I I I I I I I I I I M I I I I I I I I I I 1 I I I M I I I I I I I I I I I I 
al41 LDKMPICMAKTQYSLSDNAKLLGCPEDFRIAVRGITVSAGAGFIVALCGNMMKMPGLPKV 

490 500 510 520 530 540 



550 559 
ml41 .pep PAAEKIDVDAEGVIHGLFX 
I I I I I I t I I I I I 1 I I I I I I 
al 4 1 PAAEKIDVDAEGVIHGLFX 

550 



The following partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 559>: 

gl42 . seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCTTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAAATATGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCAT ATTGATGCAG AGgCTGCCGT 

201 TTTCCGACAG GATcggaATG AttcgCGCAC TCCGGTTTAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCGGT AACCGGCGCA ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCAC 

351 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

401 GCCATTTTTC CCCTTTAAAC CGTCCCCTAT ATAAGAATGC TGCACACAAG 

451 GCATCCCCCC ATGTGCAGCA GTTCTGA 

This coiresponds to the amino acid sequence <SEQ ID 560; ORF 142.ng>: 

gl42 .pep 

1 MRADFMFADN MPVQVRQRAF YFKLSRFAAM PNMVGKPLFG RQAGQPGKMF 

51 GNILMFVRQH IDAET^VFRQ DRNDSRTPVY AQHHGRRLVG NRRNRRHCNA 

101 VTPCRTVCRD DMNACRTGCH RITERSLKSF LQIRHFSPLN RPLYKNAAHK 

151 ASPHVQOF* 
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The following partial DNA sequence was identified in Kmeningitidis <SEQ K) 561>: 
ml42 . aeq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCCTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAGATGTGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCGT ATTGATGCAG AGGCTGCCGT 

201 TTTCCGACAG GATCX3GAATG ATTCGCGCAC TCCGGTTGAT GCACAGCATC 

251 ACGGTCGGCX3 GCTCGTCGGT AACCGGCGCG ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACX5 CCTGCCGCGC 

351 AAGATGCCAT CGCATCACGG AACGAAGTTT GAAAATTTTT CTGCAAATCC 

401 GCCATTTTTC CCCTTTAAAC TGTCCCCTAT ATAAGAATGC TGCACACAAG 

451 GCATCCCCCC ATGTGCAGCA GTTTTGA 

This corresponds to the amino acid sequence <SEQ DD 562; ORF 142>: 

ml42.pep 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDWGKPLFG RQAGQPGKMF 
51 GNILMFVRQR IDAEAAVFRQ DRNDSRTPVD AQHHGRRLVG NRRDRRHCNA 
101 VTPCRTVCRD DMNACRARCH RITERSLKIF LQIRHFSPLN CPLYKNAAHK 
151 ASPHVQQF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K^onorrhoeae 

ORF 142 shows 93.7% identity over a 158 aa overlap with a predicted ORF (ORF 142.ng) 
from N, gonorrhoeae: 

ml42/gl42 

10 20 30 40 50 60 

MRADFMFADNMPVQVRQRALYFKLSRFAAMPDWGKPLFGRQAGQPGKMFGNILMFVRQR 

illMIIIIIIIIIIIII|:|||||||M||::||||||||||||||||||||||MI|: . 
MRADFMFADNMPVQVRQRAFYFKLSRFAAMPNMVGKPL FGRQAGQPGKMFGNI LMFVRQH 
10 20 30 40 50 60 



ml42 .pep 
gl42 



70 80 90 100 110 120 

ral4 2 . pep IDAEAAVFRQDRNDSRTPVDAQHHGRRLVGNRRDRRHCNAVTPCRTVCRDDMNACRARCH 

IIIIIIMIIIIIIIIIII llllllllllllhllllllllllllllllllllll: II 
gl42 IDAEAAVFRQDRNDSRTPVYAQHHGRRLVGNRRNRRHCNAVTPCRTVCRDDMNACRTGCH 
70 80 90 100 110 120 



130 140 150 159 

ml42 . pep RITERSLKIFLQIRHFSPLNCPLYKNAAHKASPHVQQFX 

Illlllll lllllllllll IIIIIMIIIIIIIIIII 

gl42 RITERSLKSFLQIRHFSPIiNRPLYKNAAHKASPHVQQFX 
130 140 150 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 563>: 

al42 . seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCCTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAGATGTGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCGT ATTGATGCAG AGGCTGCCGT 

201 TTTCCGACAG GATCGGAATG ATTCGCGCAC TCCGGTTGAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCGGT AACCGGCGCA ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCAC 

351 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

401 GCCATTTTTC CCCTTTAAAC TGTCCCCTAT ATAAGAATGC TGCACACAAG 

451 GCACCCCCCA TGTGCAGCAG TTCTGATTCA AAAAGCCGTC GGTCGGACAT 

501 TTCCGCGCGT TACGGCGTAT TACGAGTTCA ACGCATCCTC GATTTTGGCA 

551 AGTTCTGCCA ACAGGTCTTT AAGCAGCAGC ATTTTCTCGC GGCCCAGCAC 

601 TTCCTCGATA GCGTCGTAAC GCTCGTCCAC TTCTTCGCCG ATTTCCTCAT 

651 ACAGCTTCTC GCCCTCGGCA GTCAGCTTCA GAAAAACACG TCGTTGGTCG 

701 TTGGAAGGTT TCAGGCGGAC AACCAAACCC GCTTTTTCAA GGCGGGTCAG 

751 GATACCGGTC AGGCTGGGGC GCAAAATGCA CGCCTGATTC GCCAAATCTT 
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801 GAAAGTCCAG CGTGCCGTTT TCCGCCAAAA GACGGATAAT CCGCCATTGC 
851 TGATCGGTAA TATTCGCCTG ATTCAGAATA GGCCTGAATT GGGTCATCAG 
901 GGCTTCCCTT GCCTGTATCA GACCGATATT GATAGACGCA TGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 564; ORF 142.a>: 

al42 ,pep 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDWGKPLFG RQAGQPGKMF 

51 GNILMFVRQR IDAEAAVFRQ DRNDSRTPVD AQHHGRRLVR NRRNRRHCNA 

101 VTPCRTVCRD DMNACRTGCH RITERSLKSF LQIRHFSPLN CPLYKNAAHK 

151 APPMCSSSDS KSRRSDISAR YGVLRVQRIL DFGKFCQQVF KQQHFLAAQH 

201 FLDS WTLVH FFADFLIQLL A LGSOLOKNT SLWGRFQAD NQTRFFKAGQ 

251 DTGQAGAQNA RLIRQILKVQ RAVFRQKTDN PPLLIGNIRL IQNRPELGHQ 

301 GFPCLYQTDI DRRMF* 

ml42/al42 96. 1% identity in 153 aa overlap 

10 20 30 40 50 60 

ml 4 2 . pep MRADFMFADNMPVQVRQRALYFKLSRFAAMPDWGKPLFGRQAGQPGKMFGNILMFVRQR 
l)))lltlllllll||MIIIIIIII||||||)||||tll|||||tlli||||||||||i 
MRADFMFADNMPVQVRQRALYFKLSRFAAMPDWGKPLFGRQAGQPGKMFGNILMFVRQR 
10 20 30 40 50 60 



al42 



70 80 90 100 110 120 

ml42 . pep IDAEAAVFRQDRNDSRTPVDAQHHGRRLVGNRRDRRHCNAVTPCRTVCRDDMNACRARCH 
1 I I I I I I I I I I I I I I I I I i I I I I I I I I I I llhlllllllillllMIIIIIIII: II 
IDAEAAVFRQDRNDSRTPVDAQHHGRRLVRNRRNRRHCNAVTPCRTVCRDDMNACRTGCH 
70 80 90 100 110 120 



al42 



130 140 150 159 

ml42 .pep RITERSLKIFLQIRHFSPLNCPLYKNAAHPCASPHVQQFX 
I I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I 

RITERSLKSFLQIRHFSPLNCPLYKNAAHKAPPMCSSSDSKSRRSDISARYGVLRVQRIL 
130 140 150 160 170 180 



al42 



al42 



DFGKFCQQVFKQQHFLAAQHFLDSWTLVHFFADFLIQLLALGSQLQKNTSLWGRFQAD 
190 200 210 220 230 240 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 565>: 

gl43 . seq 

1 ATGTTGAGCT TCGGCTATCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAGATG AGCCGCATTT TTCAAACGCT AGGCGCAGAC CCGCACAATT 

101 TGGGCTGGTT TTTCATCCTG CCGCCGCTGG CGGGGATGCT GGTTCAGCCG 

151 ATAGTGgGCT ACTACTGAGA CCGCACTTGG AAGCCGCGCT TGGGCGGCCG 

201 CCGCCTGCCG TATCTGCTTT ACGGCACGCT GATTGCGGTC ATCGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCCTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTGGACG TGTCGTCGAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGATATG GTCAACGAGG 

401 AGCAGAAAAG CTACGCCTAC GGGATTCAAA GTTTCTTAGC GAATACGGAC 

451 GCGGTTGTGG CAGCGATTCT GCCGTTTGTG TTcgcgtata TCGGTTTGGC 

501 GAACACTGCC GAGAAAGGCG TTGTGCCACA AACCGTGGTC GTAGCATTCT 

551 ATGTGGGTGC GGCGTTACTG ATTATTACCA GTGCGTTCAC AATCTCCAAA 

601 GTCAAAGAAT ACGACCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGTT CGAACTCTTA AAAACCGCGC 

701 CTAAAGTGTT TTGGACGGTT ACTCCGGTAC AGTTTTTCTG CTGGTTCGCC 

751 TTCCGGTATA TGTGGACTTA CTCGGCAGGC GCGATTGCAG AAAACGTCTG 

801 GCACACTACC GATGCGTCTT CCGTAGGCCA TCAGGAGGCG GGCAACCGGT 

851 ACGGCGTTTT GGCGGCGGTG TAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 ATTCTGGCAA AAGTACCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GTTTCTTCTC TATCTTCTTC ATCTACAATC 

1001 AATACGCACT CATCCTGTCT TATATCTTAA TCGGCATCGC TTGGGCGGGC 

1051 ATTATCACTT ATCCGCTGAC GATTGTGGCC AACGCTTTGT CGGGCAAACA 

1101 CATGGATACT TATTTGGGCC TGTttaacgg ctctgtCTGT ATGCcgcaaa 

1151 tcgTcgctTC GctgttgAGT TTCGTGCTTT TCCCGATGCT GGGCGGCCAT 
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1201 CAGGCAACCA TGTTCTTGGT TGCAGGCGCA GTCTTGCTGC TGGGAGCCTT 

1251 CTCAGTCTGT CTGATTAAAG AGATCCACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 566; ORF 143.ng>: 
gl43 .pep 

1 MLSEX3VLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVOP 

51 IVGYYSDRTW KPRLGGR RLP YLLYGTLIAV IVMILM PNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKSYAY GIQSFIANTD 

151 AWAAILPFV FAYIGLA NTA EKGWPQT W VAFYVGAALL IIT5A FTISK 

201 VKEYDPETYA RYHGIDVAAN QEKANWFELL KTAPKVFWTV TPVQFFCWFA 

251 FRYMWTYSAG AIAENVWHTT DASSVGHQEA GNRYG VLAAV *SVAAVICSF 

301 ILAKVPNKYH KAG YFGCLAL GALGFFSIFF lYN QYALILS YILIGIAWAG 

351 IITYPLTIVA NALSGKHMDT YLGLFN GSVC MPQIVASLLS FVL FPMLGGH 

401 QATMFLVAGA VLLLGAFSVC LIKEIHGGV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 567>: 

ml43 . seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAAATG AGCCGCATTT TTCAAACGCT AGGCGCAGAC CCGCACAATT 

101 TGGGCTGGTT TTTCATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

4 01 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC AAATACGGGC 

451 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GTGATTACCA GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACGATCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

701 CTAAGGCGTT TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 

751 TTCCAATATA TGTGGACTTA CTCGGCAGGC GCGATTGCGG AAAACGTCTG 

801 GCACACCACC GATGCGTCTT CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 

851 ACGGCGTTTT GGCGGCGGTG CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 GTATTGGCGA AAGTGCCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 

1001 AATACGCGCT GGTGTTGTCT TATACCTTAA TCGGCATCGC TTGGGCGGGC 

1051 ATTATCACTT ATCCGCTGAC GATTGTGACC AACGCCTTGT CGGGCAAGCA 

1101 TATGGGCACT TACTTGGGCT TGTTTAACGG CTCTATCTGT ATGCCTCAAA 

1151 TCGTCGCTTC GCTGTTGAGT TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 

1201 CAGGCCACTA TGTTCTTGGT AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 

1251 TTCCGTGTTC CTGATTAAAG AAACACACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 568; ORF 143>: 

ml 4 3 . pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVQP 

51 IVGHYSDRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLAN TA EKGWPQT W VAFYVGAALL VITSA FTIFK 

201 VKEYDPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKVPNKYH KAG YFGCLAL GALGFFSVFF IGNQ YALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFN GSIC MPQIVASLLS FVL FPMLGGL 

401 QATM FLVGGV VLLLGAFSVF LIKETHGGV* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. ^gonorrhoeae 



ml43 / gl43 93.9% identity in 429 aa overlap 
10 20 30 



40 



50 



60 
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ml 4 3 . pep MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGHYSDRTW 
I I t I I : I I I I I I I I I t I I I I I I I I 1 I ) I I I I I I I I t 1 I I I i t I I I I I t t I I I I : I I I I I I 
gl43 ' MLSFGYLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGYYSDRTW 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml43 . pep KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 
I 1 I I I I I I I I t I I I I I I I I I 1 I I I I I I i I I I I I I I I I I I ) I I I i I i I I I I I It I I I i 1 I I 
gl43 KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 3. pep QPFKMMVGDMVNEEQKGYAYGIQSFLANTGAWAAILPFVFAYIGLANTAEKGWPQTW 
I i I I I I I I I I i I I I I I : I I M I I I I I I I I I I i i i I I I I I I I I I I t I I I I I 1 I I I I I I I I 
gl43 QPFKMMVGDMVNEEQKSYAYGIQSFLANTDAWAAILPFVFAYIGLANTAEKGWPQTW 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml43 . pep VAFYVGAALLVITSAFTIFKVKEYDPETYARYHGIDVAANQEKANWIELLKTAPKAFWTV 
I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I i I I I I I i i I I I I I 1 I : t I I I I I I I : I I I I 
gl43 VAFYVGAALLIITSAFTISKVKEYDPETYARYHGIDVAANQEKANWFELLKTAPKVFWTV 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml43 . pep TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 
I I I I I I I I I I : I I I I I I I I I I I I I I I I I t I I I I I I : I I I I I I I I I I I I I I I I I I I I I 
gl43 TPVQFFCWFAFRYMWTYSAGAIAENVWHTTDASSVGHQEAGNRYGVLAAVXSVAAVrCSF 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml43.pep VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAGIITYPLTIVT 
: I I I I I I 1 t I I I I I I I I I I I I ) I I I I I : I I I 11111:111 I I I I I I I I I i I I I I I I I : 
gl43 ILAKVPNKYHKAGYFGCLALGALGFFSIFFIYNQYALILSYILIGIAWAGIITYPLTIVA 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml43 . pep NALSGKHMGTYLGLFNGSICMPQIVASLLSFVLFPMLGGLQATMFLVGGWLLLGAFSVF 

I I I I I I 1 I I I I I I I I I I : I ) 1 I I I I I I i I I I I I I I I I I I 1 I I I I I : I : I I I I I I I I I 
gl43 NALSGKHMDTYLGLFNGSVCMPQIVASLLSFVLFPMLGGHQATMFLVAGAVLLLGAFSVC 

370 380 390 400 410 420 

430 

ml 4 3 . pep LIKETHGGVX 
I I I I I I M I 
gl43 LIKEIHGGVX 
430 

The following partial DNA sequence was identified in K meningitidis <SEQ ED 569>: 

al43 . seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAGATG AGCCGCATCT TCCAGACGCT CGGTGCCGAT CCGCACAGCC 

101 TCGGCTGGTT CTTTATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

401 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC GAATACGGGC 

451 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GTGATTACCA GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACAATCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

701 CTAAGGCGTT TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 
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GCGATTGCGG AAAACGTCTG 
TCAGGAGGCG GGTAACTGGT 
CGGCGGTGAT TTGTTCGTTT 
AAGGCGGGTT ATTTCGGCTG 
CGTTTTCTTC ATCGGCAACC 
TCGGCATCGC TTGGGCGGGC 
AACGCCTTGT CGGGCAAGCA 
CTCTATCTGT ATGCCGCAAA 
TCCCTATGCT GGGCGGCTTG 
GTCCTGCTGC TGGGCGCGTT 
CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 570; ORF 143.a>: 

al43 .pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHSLGW FFIL PPLAGMLVQP 

51 IVGHYSDRTW KPRLGGR RLP YLLYGTLIAV IVMILM PNSG SFGFGYASLA 

101 AL5FGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLAN TA EKGWPQT W VAFYVGAALL VITSA FTIFK 

201 VKEYNPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKVPNKYH KAG YFGCLAL GALGFFSVFF I GNQY ALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFNGSIC MPQ IVASLLS FVLFPMLGGL 

401 QATM FLVGGV VLLLGAFSVF LI KETHGGV * 



751 TTCCAATATA TGTGGACTTA CTCGGCAGGC 

801 GCACACCACC GATGCGTCTT CCGTAGGTTA 

851 ACGGCGTTTT GGCGGCGGTG CAGTCGGTTG 

901 GTATTGGCGA AAGTGCCGAA TAAATACCAT 

951 TTTGGCTTTG GGCGCGCTCG GCTTTTTCTC 

1001 T^ATACGCGCT GGTGTTGTCT TATACCTTAA 

1051 ATTATCACTT ATCCGCTGAC GATTGTGACC 

1101 TATGGGCACT TACTTGGGCC TGTTTAACGG 

1151 TCGTCGCTTC GCTGTTGAGT TTCGTGCTTT 

1201 CAGGCCACTA TGTTCTTGGT AGGGGGCGTC 

1251 TTCCGTGTTC CTGATTAAAG AAACACACGG 



ml43/al43 99.5% identity in 429 aa overlap 



10 20 30 40 50 60 

inl43 . pep ML SFGFLGVQTAFTLQSSQMSR I FQTLGADPHNLGWFFI LP PLAGMLVQP IVGHYSDRTW 

I I I I I I I I I I It I I t I I I I I I I I I I I I I I I I I : I I I I M I I I I I I I I i I I I I I I I I I I I I 
al43 MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHSLGWFFILPPLAGMLVQPIVGHYSDRTW. 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml43.pep KPRLGGRRLP YLLYGT LI AVI VM I LMPNSGSFGFGYASLAALSFGALM I ALLDVSSNMAM 

I I I I I M I I I I 1 I I M I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I 1 I I I I I I I I I I 
al43 KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 
70 80 90 100 110 120 



130 140 . 150 160 170 180 

ml43.pep QPFKMMVGDMVNEEQKGYAYGIQSFLANTGAWAAILPFVFAYIGLANTAEKGWPQTW 
I M I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I i I I I I I I I I I 
al43 QPFKMMVGDMVNEEQKGYAYGIQSFLAKTGAWAAILPFVFAYIGLANTAEKGWPQTW 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 4 3. pep VAFYVGAALLVITSAFTIFKVKEYDPETYARYHGIDVAANQEKANWIELLKTAPKAFWTV 
I I I I I I I I I I I I I I I I I I I I I t I I : I I I I I I I I I I I I 1 I I I I I I I i I I I I I I I 1 i I I I I I 
al43 VAFYVGAALLVITSAFTIFKVKEYNPETYARYHGIDVAANQEKAKfWIELLKTAPKAFWTV 

190 200 210 220 230 240 



250 260 270 280 290 300 

na43 .pep TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 
1 I I i I I I I 1 I I I I i I I I I I I I I I I I 1 I I M I i I I I I I I I I I I M I I I t I I t t I I I I I I I I 
al43 TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml43 . pep VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAGIITYPLTIVT 
I I I I I I I I I I I I 1 I I I I I i I M I i I I I I I I I i i I i I ) I I I I I 1 1 I 1 I I I I t I I I I I I i I ] 
al43 VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAGIITYPLTIVT 

310 320 330 340 350 360 



ral43 .pep 



370 380 390 400 410 420 

NALSGKHMGTYLGLFNGSICMPQIVASLLSFVLFPMLGGLQATMFLVGGWLLLGAFSVF 
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al43 



I I i I I I t I I I I I i I I I I I I I I t I I I I I t I t I I I I I I I I I I I I I I 

NALSGKHMGTYLGLFNGSICMPQIVASLLSEVLFPMLGGLQATMEXVGGVVLLLGAFSVF 
370 380 390 400 410 420 



ml43.pep 



430 

LIKETHGGVX 
I I I I I I i I I I 



al43 



LIKETHGGVX 
430 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 57 1>: 



1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGGGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGTGC GTCTTCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC CCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGTAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCcgtTT 

351 CAACGCGGTG GCGGCAGACG GccgacggTt atCCCAACGA TTTGGatatT 

401 TCctaccgCT TGGACGAGGA CGGCCGGCTT ACCGTtaccT ATCGCGCCAC 

451 CGCgctCGGC GACACGGTGT TCGACCCGAC GCTGCACATT TACTGGCGGC 

501 TGGACGCGGG CCTGCACGAT GCGGTTCTGC ATATTCCGCA GGGCGGACAT 

551 ATTCCGGCCG ATGCCGAAAA ACTGCCCGTC TTAACGGTTT CAGACGGCCT 

601 CGAAGTATTT GA 



This corresponds to the amino acid sequence <SEQ ID 572; ORF 144.ng>: 



1 MSDTPATRDF GLIDGRAVTG YVLSNRRGT C VFVLDLGGIV QEFSVL ADGV 

51 RENPWSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRRLSQR FGYFLPLGRG RPAYRYLSRH 

151 RARRHGVRPD AAHLLAAGRG PARCGSAYSA GRTYSGRCRK TARLNGFRRP 

201 RSI* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 573>: 



1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGTCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGATGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAAC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTGg 

401 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

451 CCGCTTGGAC GAGGACGACC GGCTTACCGT TAcCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATGCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCTCGAAG 

651 TATTTGA 



This corresponds to the amino acid sequence <SEQ ID 574; ORF 144>: 

ml44 .pep 



1 MSDTPATRDF GLIDGRAVTG YVLSNRRGT R VCVLDLGGIV QEFSVL ADGV 

51 RENLWSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRSWLR SRLATVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYA 

201 GRCRKTARLN GFRRPRSI* 



gl44 .seq 



gl44 .pep 



ml4 4 . seq 



Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N, gonorrhoeae 

ml44 / gl44 91.3% identity in 218 aa overlap 

10 20 30 40 ' 50 60 

ml4 4 . pep MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQEFSVLADGVRENLWSFDD 
I I I I I I 1 I I I I I t I I I I I I I I I I ! I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 
g 1 4 4 MS DT PATRDFGL I DGRAVTG YVLSNRRGTCVFVLDLGG I VQEFS VLADGVRENP WS FDD 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 4 . pep AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 

1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 M 1 1 n I I t I M I I I I I I I I I I I I 

gl44 AASYADNPFQINKQIGRVAGRIRGTVAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
70 80 90 100 110 120 



130 140 150 160 170 180 

ml 4 4 . pep AADGRS WLRSRLATVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

III II II II III I II II I I II II I II II II I II II II II II II II 

gl44 AAD GRRLSQRFG— YFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

130 140 150 160 



190 200 210 219 

ml 4 4 . pep AAGRG PARCGS AYS AGRT YAGRCRKT ARLNGFRRPRS IX 

I II I I I I I I II I I I I I I I I : I I I I I I I I I I I I I I I II M 
gl44 AAGRGPARCGSAYSAGRTYSGRCRKTARLNGFRRPRSIX 
170 180 190 200 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 575>: 

al44 . seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTG. 

401 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

451 CCGCTTGGAC GAGGACGACC GGCTTACCGT TACCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATTCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCTCGAAG 

651 TATTTGA 



This corresponds to the amino acid sequence <SEQ ED 576; ORF 144.a>: 

al44 .pep 

1 MSDTPATRDF GLIDGRAVTG YVLSNRRGTR VCVLDLGGIV QEFSVLA DGV 

51 RENLVVSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRSWLR SRLXTVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYS 

201 GRCRKTARLN GFRRPRSI* 

ml44/al44 99.1% identity in 218 aa overlap 



10 20 30 40 50 60 

ml44 , pep M S DT P ATRD FG LI DGRAVTG YVLSNRRGTRVCVLDLGG I VQEFSVLADGVRENLWS FDD 

I I I I I I M I II I I I I I II II I I M I I I I I I I I I I I II t I I I I I I I II I I I I I I I I I I I I I 
a 1 4 4 MS DT PATRDFGL I DGRAVTG YVLSNRRGTRVCVLDLGG I VQE FS VLADGVRENLWS FDD 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 4 . pep AAS YADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
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MlllillllllllllllllllllllllMIIIIIIIIIIIIIMIIIIItillillll) 

al44 AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 4 . pep AADGRSWLRSRLATVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 
I I I I I 1 f I I I I I I lllllllllllMllltlllllillllllllMlllilllllllll 
al 4 4 AADGRSWLRSRLXTVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

130 140 150 160 170 180 



190 200 210 219 

AAGRGPARCGSAYSAGRTYAGRCRKTARLNGFRRPRSIX 
IIIIIIIIMIIIIIIIIhlllllllllllllllllll 
AAGRGPARCGSAYSAGRTYSGRCRKTARLNGFRRPRSIX 
190 200 210 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 577>: 

gl46. seq 

1 ATGAAGCAAA TCCCCCTCCG CCTTCTCCAG GTCGTCATTG ACCACGACAA 

51 AGTCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAa ctTCCCGACT GTCCGTCCCG CGCcctTTGA GGCGCGCGGC 

151 AAGCACGTCG AAAGAAGGCG GCAGGATAAA GATACCGACA GCTTCCGGCA 

201 GCGCGTTGCG AACCTGCGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAGCCTG CCGCCGCCAA CGCATTCACG CCCTCCGTGC TTGTGCCGTA 

301 ATAGTTGCCG AATACGTCTG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGATTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

4 01 TCGCCTTCAC GCGGCGGGCG CGTCGTATGC GACACGGAAA CGCGCAAACC 

4 51 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 TVAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAA CCGATTGTAT CACAACGGAC ACCCTATTTC 

601 ATATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 578; ORF 146.ng>: 

gl4 6.pep 

1 MKQIPLRLLQ WIDHDKVEQ YGLFDFMPCL RQPPLDNFPT VRPAPFEARG 

51 KHVERRRQDK DTDSFRQRVA NLRRALNVDF QNHVIACRRQ R IHALRACAV 

101 IVAEYVCVFQ KSLLRDKRFK LFFGNKVIMY AVCFAFTRRA RRMRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQRTPYF 

201 IFADAHILPL LF* 

The following partial DNA sequence was identified in M meningitidis <SEQ ID 579>: 

ml46. seq 

1 ATGGCGCAAA TCCTCCTCCG CTCGCGCCAA GTCGTCATTG ACCACGACAA 

51 AGTCAAACAA TACGGACTGC TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GGCGCGCGGC 

151 AAGTACGTCG AAAGAAGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCGTCGCG AACCTGCGCC GCGCCCTG7VA CGTCGATTTC CAAAATCACG 

251 TCATAGCCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC CTGTGCCGTA 

301 ATAGTTGCCA AATACGTCGG CGTATTCCAA AAAAGCTTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

401 TCGCCTTCAC GCGGCGGGCG CGTCGTGTGC GACACGGAAA CGCGCAAACC 

451 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAATGGAC ACCCAGTTTC 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 580; ORF 146>: 

ml4 6.pep 

1 MAQILLRSRQ WIDHDKVKQ YGLLDFMPCL RQPPLDNFPT VRPASVEARG 

51 KYVERRRQDK DADGFGQRVA NLRRALNVDF QNHVIACRRQ R IHTLRACAV 

101 IVAKYVGV FQ KSFLRDKRLK LFFGNKVIMY AVCFAFTRRA RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQWTPSF 

201 LFADAHILPL LF* 



ml44 .pep 
al44 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



ml46 / gl46 90.1% identity in 212 aa overlap 



10 20 30 40 50 60 

ml 4 6 . pep MAQILLRSRQWIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 
I i I II I I I I II i II : I I II : II I I I I I t I I II I I II I I II 11111:11111111 
gl4 6 MKQIPLRLLQWIDHDKVEQYGLFDFMPCLRQPPLDKFPTVRPAPFEARGKHVERRRQDK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml4 6 . pep DADGFGQRVANLRRALNVDFQNHVIACRRQRIHTLRACAVIVAKYVGVFQKSFLRDKRLK 
1:1:1 I I I I I I II I ) I 11 I I I II I I I I I I I I I : I I I I I i I I I : n II I I I : 1 I I 1 I : I 
gl46 DTDSFRQRVANLRRALNVDFQNHVIACRRQRIHALRACAVIVAEYVCVFQKSLLRDKRFK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml4 6 . pep LFFGNKVIMYAVCFAFTRRARRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

I I I I I I II I I I I I I I I hlllMIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

gl 4 6 LFFGNKVIMYAVCFAFTRRARRMRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

130 140 150 160 170 180 



190 200 210 

ml46 .pep GHIFYLYIFQPIVSQWTPSFLFADAHILPLLFX 
I I I I I II I I I I I II I II hllllllllllll 
gl4 6 GHIFYLYIFQPIVSQRTPYFIFADAHILPLLFX 

190 200 210 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 581>: 

al46.seq 

1 ATGGCGCAAA TCCTCCTCCG CCCGCGCCAA GTCATCATTG ACCACGACAA 

51 AATCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GACGCGCAGC 

151 AAGCACATCG AAAGACGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCATCTCG AACCTGAGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAACCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC TTGTGCCGTA 

301 ATAGTTGCCG AACACGTCCG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

401 TCGCCTTCAC GCGGCGGACG CGTCGTGTGC GACACGGAAA CGCGCAAACC 

451 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAACGGAC ACCCGGTTTC 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 582; ORF 146.a>: 

al46.pep 

1 MAQILLRPRQ VIIDHDKIEQ YGLFDFMPCL RQPPLDNFPT VRPASVETRS 

51 KHIERRRQDK DADGFGQRIS NLSRALNVDF QNHVITCRRQ RIHTLRACAV 

101 IVAEHVRVFQ KSLLRDKRLK LFFGNKVIMY AVCFAFTRRT RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQRTPGF 

201 LFADAHILPL LF* 

ml46/al46 90,6% identity in 212 aa overlap 

10 20 30 40 50 60 

ml46.pep MAQILLRSRQWIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 
IMIIII lll:||ll|::||||:||liMlllllllllllllllM:|:|::||tllll 
al46 MAQILLRPRQVIIDHDKIEQYGLFDFMPCLRQPPLDNFPTVRPASVETRSKHIERRRQDK 

10 20 30 40 50 60 

70 80 90 100 110 120 
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ml 4 6 . pep DADGFGQRVANLRRALNVDFQNHVIACRRQRIHTLRACAVIVAKYVGVFQKSFLRDKRLK 
I I I I I I I 1 :: I I I I I I I I I I I I I I : I I I I i I I I I I I i I t I I I :: I i I I I I : I t I I I I I 
al 4 6 DADGFGQRISNLSRALNVDFQNHVITCRRQRIHTLRACAVIVAEHVRVFQKSLLRDKRLK 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml4 6 . pep LFFGNKVIMYAVCFAFTRRARRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 
I I I I I I I I I I i I I I I t I I I : I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al4 6 LFFGNKVIMYAVCFAFTRRTRRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 
130 140 150 160 170 180 

190 200 210 

ml4 6.pep GHIFYLYIFQPIVSQWTPSFLFADAHILPLLFX 
I I I I I I I I I I t I I I I I I : I I I I I I I I I I I I I I 
al46 GHIFYLYIFQPIVSQRTPGFLFADAHILPLLE^ 

190 200 210 

The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 583>: 

gl47 . seq (partial ) 

1 . .ATGCGACGAG AAGCCAAAAT GGCACAAATC ACACTCAAAC CCATTGTTTT 

51 ATCAATTCTT TTAATCAACA CACCCCTCCT CGCCCAAGCG CATGAAACTG 

101 AGCAATCGGT GGGCTTGGAA ACGGTCAGCG TCGTCGGCAA AAGCCGTCCG 

151 CGCGCGACTT CGGGGCTGCT GCACACTTCG ACCGCCTCCG ACAAAATCAT 

201 CTCCGGCGAT ACTTTGCGCC AAAAAGCCGT CAACTTGGGC GACGCTTTGG 

251 ACGGCGTACC GGGCATCCAC GCTTCGCAAT ACGGCGGCGG CGCATCCGCT 

301 CCCGTTATTC GCGGTCAAAC GGGCAGACGG ATTAAAGTAT TGAACCATCA 

351 CGGCGAAACG GGCGATATGG CGGACTTTTC TCCCGATCAC GCCATTATGG 

401 TAGATACCGC CTTGTCGCAA CAGGTTGAAA TCCTGCGCGG GCCGGTTACG 

451 CTCTTGTACA GCTCGGgcaa tgtggccgGG GCTGGtcaat gttgccgatg 

501 gAAAAAtccc ccaaaaAAtg cc. . 

This corresponds to the amino acid sequence <SEQ ED 584; ORF 147.ng>: 

gl47.pep (partial) 

1 ..MRREAKMAQI TLKPIVLSIL LINTPLLAQA HETEQSVGLE TVSWGKSRP 

51 RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH ASQYGGGASA 

101 PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDTALSQ QVEILRGPVT 

151 LLYSSGNVAG AGQCCRWKNP PKNA. . 

The following partial DNA sequence was identified in meningitidis <SEQ ID 585>: 

ml47.seq (partial) 

1 . . CCGCATAAAA CTGAGCAATC GGTGGATTTG GAAACGGTCA GCGTCGTCGG 

51 CAAAAGCCGT CCGCGCGCCA CGTCGGGGCT GTTGCACACT TCGACCGCCT 

101 CCGACAAAAT CATCTCCGGC GATACCTTGC GCCAAAAAGC CGTCAACTTG 

151 GGCGACGCTT TAGACGGCGT ACCGGGCATC CACGCTTCGC AATACGGCGG 

201 CGGCGCGTCT GCTCCCGTCA TTCGCGGTCA AACAGGCAGG CGGATTAAAG 

251 TGTTGAACCA TCACGGCGAA ACAGGCGATA TGGCGGATTT TTCGCCCGAT 

301 CACGCCATTA TGGTAGATAC CGCCTTGTCG CAACAGGTCG AAATCCTGCG 

351 CGGGCCGGTT ACGCTCTTGT ACAGCTCGGG CAATGTGGCG GGGCTGGTCG 

401 ATGTTGCCGA TGGCAAAATC CCCGAAAAAA TGCCTGAAAA CGGCGTATCG 

451 GGCGAACTCG GATTGCGTTT GAGCAGCGGC AATCTGGAAA AACTCACGTC 

501 CGGCGGCATC AATATCGGTT TGGGCAAAAA CTTTGTATTG CACACGGAAG 

551 GGCTGTACCG CAAATCGGGG GATTACGCCG TACCGCGTTA CCGCAATCTG 

601 AAACGCCTGC CCGACAGCCA CGCCGATTCG CAAACGGGCA GCATCGGGCT 

651 GTCTTGGGTT GGCGAAAAAG GTTTTATCGG CGTAGCGTAC AGCGACCGTC 

701 GCGACCAATA TGGTCTGCCT GCCCACAGCC ACGAATACGA TGATTGCCAC 

751 GCCGACATCA TCTGGCAAAA GAGCTTGATT AACAAACGCT ATTTACAGCT 

801 TTATCCGCAC CTGTTGACCG AAGAAGACAT CGATTACGAC AATCCGGGCT 

851 TGAGCTGCGG CTTCCACGAC GACGATAATG CACACGCACA CACCCACAGC 

901 GGCAGACCGT GGATAGACCT GCGCAACAAA CGCTACGAAC TCCGTGCCGA 

951 ATGGAAGCAA CCGTTCCCCG GTTTTGAAGC CCTGCGCGTA CACCTGAACC 

1001 GCAACGACTA CCGCCACGAC GAAAAAGCAG GCGATGCAGT CGAAAACTTT 

1051 TTTAACAACC AAACGCAAAA CGCCCGCATC GAGTTGCGCC ACCAACCCAT 
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1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 



AGGTCGTCTG 
GTGCTTTATC 
AACAAAGTGC 
CAACTTCACG 
TTCAGTACGA 
CTGCCCGACC 
GGGCAACTGG 
ATCAGGAACG 
GTCGCCACCA 
TTCCAACAAT 
ACAATCTGGC 
ACCTTAAACG 
GMGCTCGTG 
GCGAAATCTA 
GACTATGTAC 
AGATGCCTAC 
CCCGTGTTCC 
GACCGTATCG 
ACTCGCCCGC 
GCGCAAACTA 
AAAGCCGACA 
CTCTGATACG 
AGTTTTAA 



AAAGGCAGCT 
TGCCATATCC 
AACATTACAG 
CTTGAAGGAG 
CAAAGCATTG 
TCGGCGCGCA 
TATTTCACGC 
CCTGCCGTCA 
ACACCTTTGA 
ATCGAACTCG 
ACTCTACCGC 
ACGGACGCGG 
CGCTACAACC 
CTTCAAACCG 
GAGGCCGTCT 
GGCAACCGTC 
GGCTGCGCGC 
ATGCCAATTT 
TACGAAACGC 
CCGCCGCAAT 
ACCTGCTCAA 
CCGCAAATGG 



GGGGCGTGCA 
GAAGCGGTTA 
CTTTTTCGGT 
GCGTACGCGT 
ATTGATCGGG 
CCGCCAAACC 
CACAACACAA 
ACGCAAGAGC 
AGTCGGCAAC 
CGCTGGGCTA 
AACCGCTTCG 
CCCCAAATCC 
AATCCGGCGC 
ACACCGCGCT 
GAAAAACCTG 
CTTTCATCGC 
CTCGGCTTCC 
GGACTACTAC 
GCACGCCCGG 
ACGCGCTATG 
CCAATCCGTT 
GCCGCAGCTT 



ATATTTACAA 
AACAACCGAT 
GTAGAACAGG 
GGAAAAACAA 
AAAACTACTA 
GCCCGCTCAT 
ACTCAGCCTG 
TGTACGCACA 
AAACACCTCA 
CGAAGGCGAC 
GTAACTACAT 
ATCGAAGACG 
CGACTTCTAC 
ACCGCATCGG 
CCTTCCCTAC 
ACAGGACGAC 
ACCTGAAAGC 
CGCGTGTTCG 
ACACCATATG 
GCGAGTGGAA 
TACGCCCACA 
TACCGGCGGC 



CAAAAATCCA 
GCTGCTTGAC 
CAAACTGGGA 
AAAGCCTCCA 
CAACCACCCC 
TCGCACTTTC 
ACCGCCTCCC 
CGGCAAACAC 
ACAAAGAGCG 
CGCTGGCAAT 
TTACGCCCAA 
ACAGCGAAAT 
GGCGCGGAAG 
CGTTTCCGGC 
CCGGCAGAGA 
CAAAATGCCC 
CTCGCTGACC 
CCCAAAACAA 
CTCAACCTCG 
TTGGTACGTC 
GCAGCTTTCT 
GTGAACGTGA 



This corresponds to the amino acid sequence <SEQ ID 586; ORF 147>: 

ml47.pep (partial) 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 



. PHKTEQSVDL 
GDALDGVPGI 
HAIMVDTALS 
GELGLRLSSG 
KRLPDSHADS 
ADIIWQKSLI 
GRPWIDLRNK 
FNNQTQNARI 
NKVQHYSFFG 
LPDLGAHRQT 
VATNTFEVGN 
TLNDGRGPKS 
DYVRGRLKNL 
DRIDANLDYY 
KADNLLNQSV 



ETVSWGKSR 
HASQYGGGAS 
QQVEILRGP7 
NLEKLTSGGI 
QTGSIGLSWV 
NKRYLQLYPH 
RYELRAEWKQ 
ELRHQPIGRL 
VEQANWDNFT 
ARSFALSGNW 
KHLNKERSNN 
lEDDSEMKLV 
PSLPGREDAY 
RVFAQNKLAR 
YAHSSFLSDT 



PRATSGLLHT 
APVIRGQTGR 
TLLYSSGNVA 
NIGLGKNFVL 
GEKGFIGVAY 
LLTEEDIDYD 
PFPGFEALRV 
KGSWGVQYLQ 
LEGGVRVEKQ 
YFTPQHKLSL 
lELALGYEGD 
RYNQSGADFY 
GNRPFIAQDD 
YETRTPGHHM 
PQMGRSFTGG 



STASDKIISG 
RIKVLNHHGE 
GLVDVADGKI 
HTEGLYRKSG 
SDRRDQYGLP 
NPGLSCGFHD 
HLNRNDYRHD 
QKSSALSAIS 
KASIQYDKAL 
TASHQERLPS 
RWQYNLALYR 
GAEGEIYFKP 
QNAPRVPAAR 
LNLGANYRRN 
VNVKF* 



DTLRQKAVNL 
TGDMADFSPD 
PEKMPENGVS 
DYAVPRYRNL 
AHSHEYDDCH 
DDNAHAHTHS 
EKAGDAVENF 
EAVKQPMLLD 
IDRENYYNHP 
TQELYMGKH 
NRFGNYIYAQ 
TPRYRIGVSG 
LGFHLKASLT 
TRYGEWNWYV 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF fiom ;\^. gonorrhoeae 

ml47 / gl47 92.3% identity in 142 aa overlap 



ml47 .pep 
gl47 

ml47 .pep 
gl47 

ml 4 7 . pep 
gl47 



10 20 30 

PHKTEQSVDLETVSWGKSRPRATSGLLHTS 
I: t I I t I I I I I M I I I M I I I I I I I I I I I 
MRREAKMAQITLKPIVLSILLINTPLLAQAHETEQSVGLETVSWGKSRPRATSGLLHTS 
10 20 30 40 50 60 

40 50 60 70 80 90 

TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
><> I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I j I I j I I I I I I I I I I I I I I I 

TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
70 80 90 100 110 120 

100 110 120 130 140 150 

GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 
I > I >> M I I I I I I I I I i I I I I I I I I I I I I t I I I I I I I I I I : I I I 

GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGAGQCCRWKNPPKNA 
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The following partial DNA sequence was identified in meningitidis <SEQ ID 587>: 

al47 . seq 

1 ATGCGACGAG AAGCCAAAAT GGCACAAACT ACACTCAAAC CCATTGTTTT 

51 ATCAATTCTT TTAATCAACA CACCCCTCCT CTCCCAAGCG CATGGAACTG 

101 AGCAATCAGT GGGCTTGGAA ACGGTCAGCG TCGTCGGCAA AAGCCGTCCG 

151 CGCGCCACTT CGGGGCTGCT GCACACTTCT ACCGCCTCCG ACAAAATCAT 

201 CAGCGGCGAC ACCTTGCGAC AAAAAGCCGT CAACTTGGGT GATGCTTTAG 

251 ACGGCGTACC GGGCATTCAT GCCTCGCAAT ACGGCGGCGG CGCATCCGCT 

301 CCCGTTATTC GCGGTCAAAC AGGCAGACGG ATTAAAGTGT TGAACCATCA 

351 CGGCGAAACG GGCGACATGG CGGACTTCTC TCCAGACCAT GCAATCATGG 

401 TGGACAGCGC CTTGTCGCAA CAGGTCGAAA TCCTGCGCGG TCCGGTTACG 

451 CTCTTGTACA GCTCGGGCAA TGTGGCGGGG CTGGTCGATG TTGCCGATGG 

501 CAAAATCCCC GAAAAAATGC CTGAAAACGG CGTATCGGGC GAACTCGGAT 

551 TGCGTTTGAG CAGCGGCAAT CTGGAAAAAC TCACGTCCGG CGGCATCAAT 

601 ATCGGTTTGG GCAAAAACTT TGTATTGCAC ACGGAAGGGC TGTACCGCAA 

651 ATCGGGGGAT TACGCCGTAC CGCGTTACCG CAATCTGAAA CGCCTGCCCG 

701 ACAGCCACGC CGATTCGCAA ACGGGCAGCA TCGGGCTGTC TTGGGTTGGC 

751 GAAAAAGGCT TTATCGGCGC AGCATACAGC GACCGTCGCG ACCAATATGG 

801 TCTGCCTGCC CACAGCCACG AATACGATGA TTGCCACGCC GACATCATCT 

851 GGCAAAAGAG TTTGATTAAC AAACGCTATT TGCAGCTTTA TCCGCACCTG 

901 TTGACCGAAG AAGACATCGA TTACGACAAT CCGGGCTTGA GCTGCGGCTT 

951 TCACGACGAC GATGATGCAC ACGCCCATGC CCACAACGGC AAACCTTGGA 

1001 TAGACCTGCG CAACAAACGC TACGAACTCC GCGCCGAATG GAAGCAACCG 

1051 TTCCCCGGTT TTGAAGCCCT GCGCGTACAC CTGAACCGCA ACGACTACCG 

1101 CCACGACGAA AAAGCAGGCG ATGCAGTAGA AAACTTTTTT AACAACCAAA 

1151 CGCAAAACGC CCGTATCGAG TTGCGCCACC AACCCATAGG CCGTCTGAAA 

1201 GGCAGCTGGG GCGTGCAATA TTTGGGACAA AAATCCAGTG CTTTATCTGC 

1251 CACATCCGAA GCGGTCAAAC AACCGATGCT GCTTGACAAT AAAGTGCAAC 

1301 ATTACAGCTT TTTCGGTGTA GAACAGGCAA ACTGGGACAA CTTCACGCTT 

1351 GAAGGCGGCG TACGCGTGGA AAAACAAAAA GCCTCCATCC GCTACGACAA 

1401 AGCATTGATT GATCGGGAAA ACTACTACAA CCATCCCCTG CCCGACCTCG 

1451 GCGCGCACCG CCAAACCGCC CGCTCATTCG CACTTTCGGG CAACTGGTAT 

1501 TTCACGCCAC TVACACTW^CT CAGCCTGACC GCCTCCCATC AGGAACGCCT 

1551 GCCGTCAACG CAAGAGCTGT ACGCACACGG CAAACACGTC GCCACCAACA 

1601 CCTTTGAAGT CGGCAACAAA CACCTCAACA AAGAGCGTTC CAACAATATC 

1651 GAACTCGCGC TGGGCTACGA AGGCGACCGC TGGCAATACA ATCTGGCACT 

1701 CTACCGCAAC CGCTTCGGCA ACTACATTTA CGCCCAAACC TTAAACGACG 

1751 GACGCGGCCC CAAATCCATC GAAGACGACA GCGAAATGAA GCTCGTGCGC 

1801 TACAACCAAT CCGGTGCGGA CTTCTACGGC GCGGAAGGCG AAATCTACTT 

1851 CAAACCGACA CCGCGCTACC GCATCGGCGT TTCCGGCGAC TATGTACGAG 

1901 GCCGTCTGAA AAACCTGCCT TCCCTACCCG GCAGGGAAGA CGCCTACGGC 

1951 AACCGCCCAC TCATTGCCCA AGCCGACCAA AACGCCCCTC GCGTTCCGGC 

2001 TGCGCGCCTC GGCGTCCACC TGAAAGCCTC GCTGACCGAC CGCATCGATG 

2051 CCAATTTGGA CTACTACCGC GTGTTCGCCC AAAACAAACT CGCCCGCTAC 

2101 GAAACGCGCA CGCCCGGACA CCATATGCTC AACCTCGGCG CAAACTACCG 

2151 CCGCAATACG CGCTATGGCG AGTGGAATTG GTACGTCAAA GCCGACAACC 

2201 TGCTCAACCA ATCCGTTTAC GCCCACAGCA GCTTCCTCTC TGATACGCCG 

2251 CAAATGGGCC GCAGCTTTAC CGGCGGCGTG AACGTGAAGT TTTAA 

This corresponds to the amino acid sequence <SEQ ID 588; ORF 147.a>: 

al47 .pep 

1 MRREAKMAQT TLKPIVLSIL LINTPLLSQA HG TEOSVGLE TVSWGKSRP 

51 RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH ASQYGGGASA 

101 PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDSALSQ QVEILRGPVT 

151 LLYSSGNVAG LVDVADGKIP EKMPENGVSG ELGLRLSSGN LEKLTSGGIN 

201 IGLGKNFVLH TEGLYRKSGD YAVPRYRNLK RLPDSHADSQ TGSIGLSWVG 

251 EKGFIGAAYS DRRDQYGLPA HSHEYDDCHA DIIWQKSLIN KRYLQLYPHL 

301 LTEEDIDYDN PGLSCGFHDD DDAHAHAHNG KPWIDLRNKR YELRAEWKQP 

351 FPGFEALRVH LNRNDYRHDE KAGDAVENFF NNQTQNARIE LRHQPIGRLK 

401 GSWGVQYLGQ KSSALSATSE AVKQPMLLDN KVQHYSFFGV EQANWDNFTL 

451 EGGVRVEKQK ASIRYDKALI DRENYYNHPL PDLGAHRQTA RSFALSGNWY 

501 FTPQHKLSLT ASHQERLPST QELYAHGKHV ATNTFEVGNK HLNKERSNNI 

551 ELALGYEGDR WQYNLALYRN RFGNYIYAQT LNDGRGPKSI EDDSEMKLVR 
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601 YNQSGADFYG AEGEIYFKPT PRYRIGVSGD YVRGRLKNLP SLPGREDAYG 

651 NRPLIAQADQ NAPRVPAARL GVHLKASLTD RIDANLDYYR VFAQNKLARY 

701 ETRTPGHHML NLGANYRRNT RYGEWNWYVK ADNLLNQSVY AHSSFLSDTP 

751 QMGRSFTGGV NVKF* 



ml47/al47 



98.1% identity in 734 aa overlap 



10 20 30 

ml 4 7 . pep PHKTEQSVDLET VS WGKSRPRATSGLLHTS 

I I I I I I I I I I I I I I I i I i I I I I I I I I i I 
al 4 7 MRREAKMAQTTLKPIVLS ILLINTPLLSQAHGTEQSVGLETVS WGKSRPRATSGLLHTS 

10 20 30 40 50 60 

40 . 50 60 70 . 80 90 

ml47 . pep TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
M I i I I I I t I I I I I I I I I Tl I I I I I I I I I I I I I I I t t I I I I I I I I I I I I I [ I I I I I i I I I 
al47 TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
70 80 90 100 110 120 

100 110 120 130 140 150 

inl47 . pep GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 
1 I I I I I I I I I I I I I I: I I I I I I I I I I I I t ) I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I 
al47 GDMADFSPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 
130 140 150 160 170 180 

160 170 180 190 200 210 

ml 4 7 . pep ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 
I I I I I I I I I I I I I I I I i I I I I I I i I I I I I I I I 1 I I I I M I I I I I I I I I I I I I I I I t I I i I 
al47 ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 
190 200 210 220 230 240 

220 230 240 250 260 270 

ml 4 7 . pep TGS IGLSWVGEKGFIGVAYSDRRDQYGLPAHSHE YDDCHADI IWQKS LINKRYLQLYPHL 

I I I I I i I I I I I I M I I : I I I I f I I I I I I I I I I I I I I I I I I t I I I I I I I I I I t I I I I I I I I 
al47 TGSIGLSWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 
250 260 270 280 290 300 

280 290 300 310 320 330 

ml47 .pep LTEEDIDYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVH 
I I I I I I 1 i I I i i I I I I I I I I I : I I I I : t : I : I I I I I I I I I I [ 1! I I I I I I I I I I I I I I I I 
al47 LTEEDIDYDNPGLSCGFHDDDDAHAHAHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVH 
310 320 330 340 350 360 

340 350 360 370 380 390 

ml47.pep LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISE 
i I I I I I I I i I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I t I I II 
al 4 7 LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKSSALSATSE 
370 380 390 400 410 420 

400 410 420 430 440 450 

ml47 , pep AVKQPMLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I : I I I I I I I I I I I I i I I I 
al47 AVKQPMLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNHPL 
430 440 450 460 470 480 



460 470 480 490 500 510 

ml47 . pep PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 
I I I I I M I I I I I I I I I I i I I 1 I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I i 
al47 PDLGAHRQTARSFALSGKWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 
490 500 510 520 530 540 

520 530 540 550 560 570 

ml4 7 . pep HLNKERSNNIELALGYEGDRWQYNLALYRNRFGKYI YAQTLNDGRGPKSIEDDSEMKLVR 

I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

al47 HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVR 
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55b 560 570 580 590 600 

580 590 600 610 620 630 

ml47 . pep YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQ 
I I I I I I I I t 1 I I t I I I I I I i I I I I I I I I I I I i M I M I I i I I I i I I I I M I I I : I I I II 
al47 YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQ 
610 620 630 640 650 660 

640 650 660 670 680 690 

ml 4 7 . pep NAPRVPAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNT 
I I I II I I II II II I II I I I I II I I II I I I II I I II I i I II II I II I I II I I li II I t I I 
al 4 7 NAPRVPAARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNT 
670 680 690 700 710 720 

700 710 720 730 

ml 4 7 . pep RYGEWNWYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
I I I I I i II I I I I I I I I I I II II I I II I I I I I I I II I I I I I I I t I I 
a 1 4 7 RYGEWNW YVKADNLLNQS V YAHS S FLS DT PQMGRS FTGG VNVKFX 

730 740 750 760 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 589>: 

gl48.seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGctgg ttcaTCCCGA 

51 AgctATgagt gtcggcgCGC TTGccgAcaa AATCCGCAAA AtcgaAAact 

101 gGCCGCAAAA AGgcaTCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGT 

151 GCGGAATACT TCCGCCTTTT GGTCGATTTG CTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGCTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAaCGtcg gctTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTAcg cgcTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGa tgccgTCAAA CCCGGTTCGC 

401 GCGTCCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC AATGCTTGCC 

451 GGGCTGGAAC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAgccgccgC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGCGCAAGTG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGCAT GAAAGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 590; ORF 148.ng>: 

gl48 .pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK lENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK PGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 591>: 

ml48 . seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGCGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTTCAAAGC 

151 GCGGAATACT TCCGCCTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

401 GCGTGCTGCT GGTCGATGAT TTGATTGCCA CGGGCGGCAC GATGCTTGCC 

451 GGACTGGAAC TGATCCGCAA ACTCGGCGGA GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 592; ORF 148>: 

ml48 .pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK lENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 
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101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK LGSRVLLVDD LIATGGTMLA 
151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. ^gonorrhoeae 

ml48 / gl48 99.0% identity in 199 aa overlap 

10 20 30 40 50 60 

MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 

I I I I I I I I I I ) I I i I I I I t I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I t 

MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 
10 20 30 40 50 60 

70 80 90 100 110 120 

LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 
i I i I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I t M I I I I t I I I I I I I I I I I I I I I 
LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 
70 80 90 100 110 120 

130 140 150 160 170 180 

AVEIHTDAVKLGSRVLLVDDLIATGGTMLAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 
M I I I I I I I I I I I I I I I I I I : I I I I i I I I I I I I I I i I i I I I M I i I I I I I I I I I I I I I I 
AVEIHTDAVKPGSRVLLVDDLVATGGTMLAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 
130 140 150 160 170 180 

190 . 200 

RASGAPLFTLLQNEGCMKGX 
I M I I I I I I I I I I I I I I I I I 
RASGAPLFTLLQNEGCMKGX 
190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 593>; 

al48 . seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGTGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGC 

151 GCGGAATACT TCCGACTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

401 GCGTGCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC GATGCTTGCC 

451 GGACTGGAGC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 594; ORF 148.a>: 

al48 .pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK lENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK LGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

ml48/al48 99.5% identity in 199 aa overlap 

10 20 30 40 50 60 

ml48 . pep MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 
I I I I t I i t I I I I { I i I I I I I I I I 1 i I I I I I I I I i I I I I I I I I I I I I i I I I I 1 I I I I I M I 
al48 MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 

10 20 30 40 50 60 

70 80 90 100 110 120 



ml48.pep 
gl48 

ml48 .pep 
gl48 

ml48 .pep 
gl48 

ml48 .pep 
gl48 
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ml48 . pep LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 
t I M M ) I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I t I I I I I I 
al4 8 LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 
70 80 90 100 110 120 

130 ■ 140 150 160 170 180 

inl4 8.pep AVEIHTDAVKLGSRVLLVDDLIATGGTMLAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 
I N I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I [ i I I I j I I I I I i I I I I I I I I I I 
al48 AVEIHTDAVKLGSRVLLVDDLVATGGTMLAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 

130 140 150 160 170 180 

190 200 
ml 4 8 . pep RASGAPLFTLLQNEGCMKGX 
I I I I I I I I I I I I I M I I I I I 
al48 RASGAPLFTLLQNEGCMKGX 

190 200 



The following partial DNA sequence was identified in ^gonorrhoeae <SEQ ID 595>: 
gl49. seq 

1 ATGTTGATTG ACAACAATGT CCGCCATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAATTGG GACAACTTCA CGCTTGAAGG CX3GCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATCCGGTAC GACAAAGCAT TGATTGATCG AGAAAACTAC 

151 TACAACCAGC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 GTTCGCACTT TCGGGCAACT GGTATTTCAC GCCACACCAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAa cgCCTGCCGT CAACGCaagA actGtACgca 

301 cacggcAAGC ACGtcgccac CAACACCTTT GAagtcggca acaaACACCT 

351 CAACAAAGaG CgttccaacA atatcgaACT . CGCGCTGGgc tAcaaaggcg 

401 accGCTGGCA ATACAATCTG GCAGCCTACC GCAACCGAtT CGGCAACTAC 

451 ATTTACGCCC AAACCTTAaa cgacggacgC GGCCCCAAAT CCATCgaaga 

501 cgacagcgaA ATGaagcTCG TGCGCTACAA CCAATCCGGT GCCGACTTCT 

551 ACGgcgcggA aggcgaaatc tACTTcaaaC CGAcACCGCG CTACCGCATC 

601 6GTGTTTCCG GCGACTatgt acgaggccgT CTGAAAAACC TGCCGTCCCT 

651 ACCCGGCAGG gaagatccCT AcggcAAACG TCccttcaTC GCACAAGCCG 

701 ACCAAAACGC CCCCCGCATT ccggctGCGC GCCTCGGCTT CCACCTGAAA 

751 ACCTCGCTAA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGTACGCCC GGACACCATA 

851 TGCTCAACCT CGGTGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCtC aACcaatCcg tTTACGCCCa 

951 CAGCAGCTTC CTCTCTGATA CGCCGCAAAt gGGCCGCAGC TTtgccgGCg 

1001 gcgtaAACGT GaAGTTttaA 

This corresponds to the amino acid sequence <SEQ ID 596; ORF 149.ng>: 
gl49 .pep 

1 MLIDNNVRHY SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALIDRENY 

51 YNQPLPDLGA HRQTARSFAL SGNWYFTPHH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YKGDRWQYNL AAYRNRFGNY 

151 lYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDPYGKRPFI AQADQNAPRI PAARIiGFHLK 

251 TSLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRENTRYGEW 

301 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FAGGVNVKF* 

The following partial DNA sequence was identified in Kmeningitidis <SEQ ID 597>: 

ml49,seq 

1 ATGCTGCTTG ACAACAAAGT GCAACATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAACTGG GACAACTTCA CGCTTGAAGG AGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATTCAGTAC GACAAAGCAT TGATTGATCG GGAAAACTAC 

151 TACAACCACC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA GCTGTACGCA 

301 CACGGCAAAC ACGTCGCCAC CAACACCTTT GAAGTCGGCA ACAAACACCT 

351 CAACAAAGAG CGTTCCAACA ATATCGAACT CGCGCTGGGC TACGAAGGCG 

401 ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCGCTT CGGTAACTAC 
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451 ATTTACGCCC AAACCTTAAA CGACGGACGC GGCCCCAAAT CCATCGAAGA 

501 CGACAGCGAA ATGAAGCTCG TGCGCTACAA CCAATCCGGC GCCGACTTCT 

551 ACGGCGCQGA AGGCGAAATC TACTTCAAAC CGACACCGCG CTACCGCATC 

601 GGCGTTTCCG GCGACTATGT ACGAGGCCGT CTGAAAAACC TGCCTTCCCT 

651 ACCCGGCAGA GAAGATGCCT ACGGCAACCG TCCTTTCATC GCACAGGACG 

701 ACCAAAATGC CCCCCGTGTT CCGGCTGCGC GCCTCGGCTT CCACCTGAAA 

751 GCCTCGCTGA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGCACGCCC GGACACCATA 

851 TGCTCAACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCTC AACCAATCCG TTTACGCCCA 

951 CAGCAGCTTT CTCTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 

1001 GCGTGAACGT GAAGTTTTAA 

This corresponds to the amino acid sequence <SEQ K) 598; ORF 149>: 
ml49.pep 

1 MLIiDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIQY DKALIDRENY 

51 YNHPLPDLGA HRQTARSFAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL ALYRNRFGNY 

151 lYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDAYGNRPFI AQDDQNAPRV PAARLGFHLK 

251 ASLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

301 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF fix>m K^onorrhoeae 

ORF 149 shows 95.9% identity over a 339 aa overlap with a predicted ORF (ORF 149.ng) 
from iV. gonorrhoeae: 

ml49/gl49 

10 20 30 40 50 60 

MLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENY YNHPLPDLGA 

Ihlhhillllllllllllllllllllllllllllhllllllllllllhlllllil 

ML IDNNVRHYS FFGVEQANWDNFTLEGGVRVEKQKAS I RYDKALIDRENYYNQPLPDLGA 
10 20 30 40 50 60 

70 80 90 100 110 120 

HRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

lllllllllllillMlhlMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

HRQTARSFALSGNWYFTPHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 
70 80 90 100 110 120 

130 140 150 160 170 180 

RSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 

IIMIIIIIIiHIIIIIIII llllllllllllllllllllllilllllllllllllll 

RSNNI ELALGYKGDRWQ YNLAAYRNRFGNYI YAQTLNDGRGPKS I EDDS EMKLVRYNQSG 
130 140 150 160 170 180 

190 200 210 220 230 240 

ADFYGAEGE I YFKPTPRYRI GVSGD YVRGRLKNLPS LPGREDAYGNRPFI AQDDQNAPRV 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii ihiiiiii mill: 

ADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDPYGKRPFIAQADQNAPRI 
190 200 210 220 230 240 

250 260 270 280 290 300 

PAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 

llllilllihIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

PAARLGFHLKTSLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 
250 260 270 280 290 300 

310 320 330 340 

NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

MIIIIIIIIIIIIIIIIIMIIIIIIIII|:|IMIIII 



ml49 .pep 
gi49 

tnl4 9 .pep 
gl49 

ml49 .pep 
gi49 

Tnl49 .pep 
gl49 

tnl49 .pep 
gl49 

ml49 .pep 
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gl4 9 NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFAGGVNVKFX 
310 320 330 340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 599>: 

al49. seq 

1 ATGCTGCTTG AC/^TAAAGT GCAACATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAACTGG GACAACTTCA CGCTTGAAGG CGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATCCGCTAC GACAAAGCAT TGATTGATCG GGAAAACTAC 

151 TACAACCATC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA GCTGTACGCA 

301 CACGGCAAAC ACGTCGCCAC CAACACCTTT GAAGTCGGCA ACAAACACCT 

351 CAACAAAGAG CGTTCCAACA ATATCGAACT CGCGCTGGGC TACGAAGGCG 

401 ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCGCTT CGGCAACTAC 

451 ATTTACGCCC AAACCTTAAA CGACGGACGC GGCCCCAAAT CCATCGAAGA 

501 CGACAGCGAA ATGAAGCTCG TGCGCTACAA CCAATCCGGT GCGGACTTCT 

551 ACGGCGCGGA AGGCGAAATC TACTTCAAAC CGACACCGCG CTACCGCATC 

601 GGCGTTTCCG GCGACTATGT ACGAGGCCGT CTGAAAAACC TGCCTTCCCT 

651 ACCCGGCAGG GAAGACGCCT ACGGCAACCG CCCACTCATT GCCCAAGCCG 

701 ACCAAAACGC CCCTCGCGTT CCGGCTGCGC GCCTCGGCGT CCACCTGAAA 

751 GCCTCGCTGA CCGACCGCAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGCACGCCC GGACACCATA 

851 TGCTC7VACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCTC AACCAATCCG TTTACGCCCA 

951 CAGCAGCTTC CTCTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 

1001 GCGTGAACGT GAAGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 600; ORF 149.a>: 

al4 9.pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALIDRENY 

51 YNHPLPDLGA HRQTARSFAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL ALYRNRFGNY 

151 lYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDAYGNRPLI AQADQNAPRV PAARLGVHLK 

251 ASLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

301 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* 

ml49/al49 98.8% identity in 339 aa overlap 

10 20 30 40 50 60 

ml 4 9 . pep MLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENY YNHPLPDLGA 

I I I I I I I t I I I I I I I i I I i I I I I I 1 I I j I I I I I I I I I I: I I I I I I I I I I I I I I I I I I I I I 
al4 9 MLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNHPLPDLGA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 9 . pep HRQTARSFAL SGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

liMMIMIIIIIIMIIIIIIIIitlllllllllllllllllllllllllllllllll 
al49 HRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 9. pep RSNNXELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 
I I I I I I i I I M I I I I I t I I I I I I I I I M I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al4 9 RSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 9 . pep ADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRV 
I I M I I I t I I I I I I I I I I I I I I I I I M I I I M I I I I I I I ) I I I I I I I I : I I I I I I I I I i 
al4 9 ADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQNAPRV 

190 200 210 220 230 240 



ml4 9.pep 



250 260 270 280 290 300 

PAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 



m 
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M I i I I I t I I I I I I M t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I 
al4 9 PAARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 
250 260 270 280 290 300 

310 320 330 340 

ml 4 9 . pep NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 
al4 9 NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

310 320 330 340 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 601>: 

gl4d-l.seq 

1 ATGGCACAAA TCACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCGAC TTCGGGGCTG 

151 CTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACTTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT GGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 ACGGGCAGAC GGATTAAAGT ATTGAACCAT CACGGCGAAA CGGGCGATAT 

351 GGCGGACTTT TCTCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTTGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGAAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGaagccgG ATTGCGTTTG AGCAGCGGCA 

551 ATTTAGAAAA ACTGACATCC GCAGGCATCA ATATCGGACT GGGCAAAAAC 

601 TTCGTGCTGC ATACCGAAGG CTTGTACCGC AAATCGGGCG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAT GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTGG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCGCTAC GGCCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATCA 

851 ACAAACGCTA TTTGCAGCTT TATCCGCACT TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG GCGACGGTGC 

951 ACACGCACAC ACCCACAACG GCAAACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAGC CATTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ATCTGAACCG CAATGACTAC CACCACGACG AAAAAGCAGG 

1101 CGATGCAGTA GAAAACTTCT TCAACAACAA AACACACAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG CGCGCTTTCC GCCATTCCCG AAACCGTCCA 

1251 ACAACCGATG TTGATTGACA ACAATGTCCG CCATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAATTGGGAC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGGTACGAC AAAGCATTGA TTGATCGAGA 

1401 AAACTACTAC AACCAGCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCGTT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACACCACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAACT 

1551 GTACGCACAC GGCAAGCACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA GCCTACCGCA ACCGATTCGG 

1701 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCC 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGT GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CGTCCCTACC CGGCAGGGAA GATCCCTACG GCAAACGTCC CTTCATCGCA 

1951 CAAGCCGACC AAAACGCCCC CCGCATTCCG GCTGCGCGCC TCGGCTTCCA 

2001 CCTGAAAACC TCGCTAACCG ACCGTATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG TACGCCCGGA 

2101 CACCATATGC TCAACCTCGG TGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TAAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 602; ORF 149-l.ng>: 

gl49-l.pep 

1 MAQITLKPIV LSILLINTPL LAQAH ETEQS VGLETVSWG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGEAGLRL SSGNLEKLTS AGINIGLGKN 

201 FVLHTEGLVR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 AAYSDRRDRY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDGDGAHAH THNGKPWIDL RNKRYELRAE WKQPFPGFEA 



wo 99/57280 PCT/US99/09346 

422 



351 LRVHLNRNDY HHDEKAGDAV ENFFNNKTHN ARIELRHQPI GRLKGSWGVQ 

401 YLGQKSSALS AIPETVQQPM LIDNNVRHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIRYD KALIDRENYY NQPLPDLGAH RQTARSFALS GNWYFTPHHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 

551 EGDRWQYNLA AYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DPYGKRPFIA 

651 QADQNAPRIP AARLGFHLKT SLTDRIDANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 603>: 

ml49-l . seq 

1 ATGGCACAAA CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGATTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCCAC GTCGGGGCTG 

151 TTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACCTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT* AGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCGTCTG CTCCCGTCAT TCGCGGTCAA 

301 ACAGGCAGGC GGATTAAAGT GTTGAACCAT CACGGCGAAA CAGGCGATAT 

351 GGCGGATTTT TCGCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTCGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG AGCAGCGGCA 

551 ATCTGGAAAA ACTCACGTCC GGCGGCATCA ATATCGGTTT GGGCAAAAAC 

601 TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTTG GCGAAAAAGG TTTTATCGGC 

751 GTAGCGTACA GCGACCGTCG CGACCAATAT GGTCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGCTTGATTA 

851 ACT^CGCTA TTTACAGCTT TATCCGCACC TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG ACGATAATGC 

951 ACACGCACAC ACCCACAGCG GCAGACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGTGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG AAAAAGCAGG 

1101 CGATGCAGTC GAAAACTTTT TTAACAACCA AACGCAAAAC GCCCGCATCG 

1151 AGTTGCGCCA CCAACCCATA GGTCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTACAAC AAAAATCCAG TGCTTTATCT GCCATATCCG AAGCGGTTAA 

1251 ACAACCGATG CTGCTTGACA ACAAAGTGCA ACATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAACTGGGAC AACTTCACGC TTGAAGGAGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT TCAGTACGAC AAAGCATTGA TTGATCGGGA 

1401 717ACTACTAC AACCACCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCATT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACAACACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 

1551 GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACCGCTTCGG 

1701 TAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGCGCC 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGC GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CTTCCCTACC CGGCAGAGAA GATGCCTACG GCAACCGTCC TTTCATCGCA 

1951 CAGGACGACC AAAATGCCCC CCGTGTTCCG GCTGCGCGCC TCGGCTTCCA 

2001 CCTGAAAGCC TCGCTGACCG ACCGTATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CACGCCCGGA 

2101 CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTTCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TGAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 604; ORF 149-1>: 

ml49-l.pep 

1 MAQTTLKPIV LSILLINTPL LAQA HETEQS VDLETVSVVG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGELGLRL SSGNLEKLTS GGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 VAYSDRRDQY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDDDNAHAH THSGRPWIDL RNKRYELRAE WKQPFPGFEA 

351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 

401 YLQQKSSALS AISEAVKQPM LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIQYD £CALIDRENYY NHPLPDLGAH RQTARSFALS GNWYFTPQHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 
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551 EGDRWQYNLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPFIA 

651 QDDQNAPRVP AARLGFHLKA SLTDRIDANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

ml49-X/gl49-l 96.2% identity in 758 aa overlap 

10 20 30 40 50 60 

ml4 9-l.pep MAQTTLKPIVLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKI 
Ml IIIIIIIIKIIIIIIIIIIillllll I I I I I I I I I I I I I I I I I i I I I I I I I I I I 
gl49-l MAQITLKPIVLSILLINTPLLAQAHETEQSVGLETVSWGKSRPRATSGLLHTSTASDKI 

10 20 30 40 50 60 

70 80 90 100 110 120 

inl49-l.pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
M I I I I I I I I I I I I I I I I I I I I I I I I I I i I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl49-l ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml4 9 - 1 . pep S PDHAIMVDTALSQQVEI LRGPVTLLYS SGNVAGLVDVADGKI PEKMPENGVSGELGLRL 
MM IIIIIMIIIIIMIIIIIIIIIIIilllllltllllllllllll nil 

gl4 9-l S PDHAIMVDTALSQQVEI LRGPVTLLYS SGNVAGLVDVADGKIPEKMPENGVSGEAGLRL 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 9-1. pep SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

I II I I I II II : II III M I It I I I I I I I I I I I I t 1 I II I I I I I II I I I I I 1 I I I I I II II 
gl4 9-l SSGNLEKLTSAGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 4 9-1. pep SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 

II I li II II i • II i II II : II I I II I I I I II I II I II I II I I II II II I I I I I I I II I II 
gl 4 9-1 SWVGEKGFIGAAYSDRRDRYGLPAHSHEYDDCHADI IWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 4 9-1 . pep DYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
I I I I I I I II I I I I I : I I I I I I : I : I I i I I II I I I I I I I I I I I I I I I I I I t I I I I I I I I I 
gl49-l DYDNPGLSCGFHDGDGAHAHTHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 4 9 - 1 . pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKQPM 
:|lllllllllllll|:|:||||||||||||||||||||||| l||||||tl |:|:||| 
gl49-l HHDEKAGDAVENFFNNKTHNARIELRHQPIGEILKGSWGVQYLGQKSSALSAIPETVQQPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 4 9-1 . pep LLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPLPDLGAH 
I:||:|:|||llllllllltllltlll||||||||||:|||||||||||i|:)||||||l 
g 14 9- 1 LIDNNVRHYS FFGVEQANWDNFTLEGGVRVEKQKAS IRYDKALI DRENYYNQPLPDLGAH 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml4 9-1 . pep RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 
tllllllllilllllll:|llllllillllllllllll|l||l|||||||il|||||||| 
gl49-l RQTARSFALSGNWYFTPHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml49-l . pep SNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 
M I I I I I I I I I I I I i I I I I I I I I f I I 1 I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
gl49-l SNN I ELALGYEGDRWQYNLAAYRNRFGNY I YAQTLNDGRG PKS I EDDSEMKLVRYNQSGA 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml49-l , pep DFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRVP 
IMIIilllMlllinililllllllllllllllllllll 11:111111 llllll:| 
gl49-l DFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDPYGKRPFIAQADQNAPRIP 

610 620 630 640 650 660 
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670 680 690 700 710 720 

ml49-l.pep AARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 

lilllll)|:lllllllillillllllll> Illlllllllllllllilllilll 

gl49-l AARLGFHLKTSLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 
670 680 690 700 710 720 

730 740 750 759 

ml49-l .pep WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

IIIMIIIIMIIIMIIMIIilll Illlllil 

gl49-l WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

730 740 750 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 605>: 

1 ATGGCACAAA CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCTCCCAAG CGCATGGAAC TGAGCAATCA GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCCAC TTCGGGGCTG 

151 CTGCACACTT CTACCGCCTC CGACAAAATC ATCAGCGGCG ACACCTTGCG 

201 ACAAAAAGCC GTCAACTTGG GTGATGCTTT AGACGGCGTA CCGGGCATTC 

251 ATGCCTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 ACAGGCAGAC GGATTTiAAGT GTTGAACCAT CACGGCG7JA CGGGCGACAT 

351 GGCGGACTTC TCTCCAGACC ATGCAATCAT GGTGGACAGC GCCTTGTCGC 

401 TUVCAGGTCGA AATCCTGCGC GGTCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG AGCAGCGGCA, 

551 ATCTGGAAAA ACTCACGTCC GGCGGCATCA ATATCGGTTT GGGCAAAAAC 

601 TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTTG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCAATAT GGTCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATTA 

851 ACAAACGCTA TTTGCAGCTT TATCCGCACC TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTTCACGACG ACGATGATGC 

951 ACACGCCCAT GCCCACAACG GCAAACCTTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG T^AAAAGCAGG 

1101 CGATGCAGTA GAAAACTTTT TTAACAACCA AACGCAAAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG TGCTTTATCT GCCACATCCG AAGCGGTCAA 

1251 ACAACCGATG CTGCTTGACA ATAAAGTGCA ACATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAACTGGGAC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGCTACGAC AAAGCATTGA TTGATCGGGA 

1401 AAACTACTAC AACCATCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCATT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACAACACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 

1551 GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACCGCTTCGG 

1701 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCG 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGC GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CTTCCCTACC CGGCAGGGAA GACGCCTACG GCAACCGCCC ACTCATTGCC 

1951 CAAGCCGACC AAAACGCCCC TCGCGTTCCG GCTGCGCGCC TCGGCGTCCA 

2001 CCTGAAAGCC TCGCTGACCG ACCGCATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CACGCCCGGA 

2101 CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TGAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 606; ORF 149-l.a>: 

al49-l.pep 

1 MAQTTLKPIV LSILLINTPL LSQAH GTEQS VGLETVSVVG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDS ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGELGLRL SSGNLEKLTS GGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 AAYSDRRDQY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDDDDAHAH AHNGKPWIDL RNKRYELRAE WKQPFPGFEA 
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351 LRVHLNRNDY RHDEKAGDAV 

401 YLGQKSSALS ATSEAVKQPM 

451 EKQKASIRYD KALIDRENYY 

501 LSLTASHQER LPSTQELYAH 

551 EGDRWQYNLA LYRNRFGNYI 

601 DFYGAEGEIY FKPTPRYRIG 

651 QADQNAPRVP AARLGVHLKA 

701 HHMLNLGANY RRNTRYGEWN 

751 TGGVNVKF* 



ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 
LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 
NHPLPDLGAH RQTARSFALS GNWYFTPQHK 
GKHVATNTFE VGNKHLNKER SNNIELALGY 
YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 
VSGDYVRGRL KNLPSLPGRE DAYGNRPLIA 
SLTDRIDANL DYYRVFAQNK LARYETRTPG 
WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 



al49-l/ml49-l 98.0% identity in 758 aa overlap 



10 20 30 40 50 60 

al 4 9- 1 . pep MAQTTLKPI VLSILLINTPLLSQAHGTEQSVGLETVSWGKSRPRATSGLLHTSTASDKI 

ltttlll[IIIIIIIIIIIII:||| llltl lllllllllllllllll Mint 

ml 4 9-1 MAQTTLKPI VLS I LLINTPLLAQAHETEQSVDLETVSWGKSRPRATSGLLHTSTASDKI 

10 20 . 30 40 50 60 

70 80 90 100 110 120 

al49-l.pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
Mllllllllllilllllllllllllttlllllllittllllllllillllilllllill 
ml 4 9- 1 I SGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 

70 80 90 100 110 120 

130 140 150 160 170 180 

al 4 9 - 1 . pep SPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKI PEKMPENGVSGELGLRL 

IIIIMIII: lit Illlllllilllllllljltlillllllllllllll 

ml 4 9-1 SPDHAIMVDTALSQQVE I LRGPVTLLYSSGNVAGLVDVADGKI PEKMPENGVSGELGLRL 

130 140 150 160 170 180 

190 200 210 220 230 240 

al 4 9-1. pep SSGNLEKLTSGGINIGLGKKFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

Illtlll Illllllllllllllllllll Illllllllll 

ml49-l SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

al 4 9-1. pep SWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 

IMIIIMII-llllllllllll Illllllllll Iltllllll 

ral49-l SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

al4 9-1 . pep DYDNPGLSCGFHDDDDAHAHAHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
IIIIMIIIIIIIII-llit:|:l:IIMIIIIIIIIIIIIIIIIIIIIIIIIII)ltll 
ml 4 9- 1 DYDNPGLSCGFHDDDNAHAHTHSGRPWI DLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

al49-l.pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKS SALS ATSEAVKQPM 

lllllilllltiltllllllllll II II I II III II I III I t 

ml49-l RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKQPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

al49-l.pep LLDNKVQHYS FFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNH PL PDLGAH 

IMIIIMIillllllll Illllllhllllllllllllllllllllli 

ml 4 9-1 LLDNKVQHYS FFGVEQANWDNFTLEGGVRVEKQKAS I QYDKALIDRENYYNH PL PDLGAH 

430 440 450 460 470 480 

490 500 510 520 530 540 

al49-l.pep RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

N I I I I I I I I I I II I lllllllllllllllll I I I I I 11 I I 

ml49-l RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

al49-l.pep SNNIELALGYEGDRWQYNLALYRNRFGNY I YAQTLNDGRGPKS I EDDS EMKLVRYNQSGA 

Illlllllllllll I Illlllllllllll 

ml49-l SNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 
550 560 570 580 590 600 



610 620 630 640 650 660 



wo 99/57280 



PCTAJS99/09346 



426 



al49-l.pep DFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQNAPRVP 
IMIIIMIMIIIIIilltllllllllllllllll||]tlll|||i:||| llllllll 
ml49-l DFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRVP 
610 620 630 640 650 660 

670 680 690 700 710 720 

al49-l.pep AARLGVHLKASLTDRI DANLDYYRVFAQNKLARYETRTPGHHMLNLGAN YRRNTRYGEWN 

mil iiiiiiiiiniiitiiiiiiiiiiiiiiMiiMii iiiiiiiiiii 

ml 4 9-1 AARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGAN YRRNTRYGEWN 

670 680 690 700 710 720 

730 740 750 759 

al 4 9-1 .pep WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
i II I I I II I I II I I I M I I I I I I I I I I t I I I tl I I I I I I 
ml 4 9-1 WYVKADNLLNQSVYAHSSFLSDTPC^GRSFTGGVNVKFX 

730 740 750 

The following partial DNA sequence was identified in ^gonorrhoeae <SEQ ID 607>: 

glSO.seq (partial) 

1 . . TACTGCAAGG CAGACCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGAAAAT 

51 CACCGCCCGC CAATCCGATA AAGACGTGCG CCACATCX3AA ATCGATTTGA 

101 GCGGTTCGGA TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT 

151 GACAACGATC CGGCACTGGT CGGGGAAATC CTAGACCTGC TCGGCATCAA 

201 TCCGGCAACG GAAATACAGG CGGGCGGAAA AACCCTGCCG GTTGCCTCCG 

251 CACTGTTATC CCATTTCGAA CTCACGCAAA ACACCCCCGC CTTTGTCAAA 

301 GGCTATGCCA CGTTCGCCGA TAATGACGAA CTCGACCGTA TTGCTGCCGA 

351 CAACGCCGTT TTGCAAGGCT TTGTGCAAAG CACGCCGATT GCCGGTGTGC 

401 TGCACCGCTT CCCGGCAAAA CTGACGGCGG AACAATTCGC CGGCCTGCTG 

451 CGCCCGCTTG CGCCGCGCCT GTATTCGATT TCCTCGTCGC AGGCGGAAGC 

501 GGGGGACGAA GTGCACCTGA CCGTCGGCGC AGTGCGTTTC GAACACGAAG 

551 GGCGCGCCAG GGCGGGCGGC GCATCGGGTT TCTTTGCCGA CCGGCTGGAA 

601 GAGGACGGCA CGGTGCGCGT GTTTGCGGAA CGCAACGACG GCTTCAGGCT 

651 GCCCGAAGAC AGCCGCAAGC CGATTGTGAT GATCGGCTCC GGTACCGGCG 

701 TCGCACCGTT CCGCGCCTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA 

751 GGCAGAAACT GGCTGATTTT CGGCAATCCG CATTTTGCCG CCGACTTCCT 

801 CTATCAGACC GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGAT 

851 ATGACTTCGC CTGGTCGCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC 

901 AAAATCCGCG AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC 

951 GCATATCTAT GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GAAGTGGAAG 

1001 CCGCCTTGCT GGATGTGATT ATCGGGGCAG GGCATTCGGA CGAAGACGGC 

1051 GCAGAAGGAT ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA 

1101 TGTTTATTGA 

This coiresponds to the amino acid sequence <SEQ ID 608; ORF 150.ng>: 

gl50 .pep (partial) 

1 ..YCKADPFPAA LLANQKITAR QSDKDVRHIE IDLSGSDLHY LPGDALGVWF 

51 DNDPALVGEI LDLLGINPAT EIQAGGKTLP VASALLSHFE liTQNTPAFVK 

101 GYATFADNDE LDRIAADNAV LQGFVQSTPI AGVLHRFPAK LTAEQFAGLL 

151 RPIiAPRLYSI SSSQMAGDE VHLTVGAVRF EHEGRARAGG ASGFFADRLE 

201 EDGTVRVFAE RNDGFRLPED SRKPIVMIGS GTGVAPFRAF VQQRAAENAE 

251 GRNWLIFGNP HFAADFLYQT EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD 

301 KIREQAEGLW QWLQEGAHIY VCGDAAKMAK EVEAALLDVI IGAGHSDEDG 

351 AEGYLDMLRE EKRYQRDVY* 

The following partial DNA sequence was identified in N .meningitidis <SEQ ID 609>: 

ml50 . seq 

1 ATGCAGAACA CAAATCCGCC ATTACCGCCT CTGCCGCCCG AAATCACGCA 

51 GCTCCTGTCG GGGCTGGACG CGGCACAATG GGCGTGGCTG TCCGGCTACG 

101 CTTGGGCAAA AGCAGGAAAC GGGGCATCTG CAGGACTGCC CGCGCTTCAG 

151 ACGGCATTGC CGGCGGCAGA ACCTTTTTCC GTAACCGTCC TTTCCGCCTC 

201 GCAAACCGGC AATGCGAAAT CCGTTGCCGA CAAAGCGGCG GACAGCCTGG 

251 AAGCCGCCGG CATCCAAGTC AGTCGCGCCG AACTGAAAGA CTATAAGGCG 

301 AAAAACATCG CCGGCGAACG CGGCCTGCTG CTGGTTACCT CCACCCAAGG 

351 CGAAGGCGAA CCGCCGAAAG AAGCCGTCGT GCTGCACAAA CTGCTGAACG 

401 GCAAAAAAGC CCCGAAATTG GACAAACTCC AATTTGCCGT ACTGGGTTTG 
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451 GGCGACAGTT CCTATCCGAA TTTCTGTCAG GCAGGTAAAG ATTTCGACCG 

501 GCGTTTTGAA GAATTGGGCG CAAAACGGCT GCTCGAACGC GTTGATGCGG 

551 ATTTGGACTT TACCGCCTCC GCAAACGCCT GGACAGATAA TATCGCCGCA 

601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCX3CAGAC 

651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGC7AGG 

701 CAGCCCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGAAAAT CACCGCCCGC 

751 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA GCGGTTCGGA 

801 TTTGCACTAC CTCCCGGGCG ACX3CGCTCGG CGTTTGGTTT GACAACGATC 

851 CGGCACTGGT CAGGGAAATC CTAGACCTGC TCXK3CATCGA TCCGGCAACG 

901 GAAATACAGG CGGGCGGAAA GATGATGCCG GTTGCGCGCG CACTTTCATC 

951 TCATTTCGAA CTCACGCAAA ACACTCCGGC TTTCGTCAAA GGCTATGCCG 

1001 CGTTCGCCCA TTATGAAGAA CTCGATAAAA TCATTGCCGA TAACGCCGTT 

1051 TTGCAGGATT TCGTGCAAAA CACGCCTATT GTCGATGTGC TGCACCGCTT 

1101 CCCGGCAAGC CTGACGGCAG AACAATTCAT CCGTTTACTG CX3TCCGCTTG 

1151 CACCCCGTTT GTATTCGATT TCTTCAGCAC AGGCGGAAGT GGGCGATGAA 

1201 GTGCATTTAA CTGTCGGCGT GGTTCGTTTT GAACACGAAG GCCGCGCCAG 

1251 AACGGGCGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

1301 CGGTGCGCGT GTTTGTGGAA CGCMCGACG GCTTCAGGCT GCCCGAAGAC 

1351 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 

1401 CCGCGCTTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA GGCAAAAACT 

1451 GGCTGATTTT CGGCAATCCG CATTTTGCCC GTGATTTTCT CTATCAAACC 

1501 GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGGT ACGATTTCGC 

1551 CTGGTCCCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AT^TCCGCG 

1601 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GCATATCTAT 

1651 GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GACGTGGAAG CCGCCTTGCT 

1701 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GCAGAAGAAT 

1751 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ID 610; ORF 150>: 



ml50 .pep 




1 


MQNTNPPLPP 


51 


TALPAAEPFS 


101 


KNIAGERRLL 


151 


GDSSYPNFCQ 


201 


LLKEEAAKNR 


251 


QSDKDVRHIE 


301 


EIQAGGKMMP 


351 


LQDFVQNTPI 


401 


VHLTVGWRF 


451 


SRKPIVMIGS 


501 


EWQQFAKDGF 


551 


VCGDAAKMAK 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 150 shows 91.3% identity over a 369 aa overlap with a predicted ORF (ORF ISO.ng) 
from N, gonorrhoeae: 

mlSO/glSO 

210 220 230 240 250 260 

ml50 . pep LLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKAAPFPAALLANQKITARQSDKDVRHIE 

I I I I I I I I I I I I I I I I I M i M I I I I I I I 
gl50 YCKADPFPAALLANQKITARQSDKDVRHIE 

10 20 30 

270 280 290 300 310 320 

ml50 . pep IDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPATEIQAGGKMMPVARALSSHFE 
IMIIIIIIIIIIIIilllMIMIII IIMf IM:||||||j||| M Mil 

gl50 IDLSGSDLHYLPGDALGVWFDNDPALVGEILDLLGINPATEIQAGGKTLPVASALLSHFE 
40 50 60 70 80 90 

330 340 350 360 370 380 

ml50 . pep LTQNTPAFVKGYAAFAHYEELDKIIADNAVLQDFVQNTPIVDVLHRFPASLTAEQFIRLL 
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lllltlllllll|:|| :|||:| lllllll II 

gl50 LTQNTPAFVKGYATFADNDELDRIAADNAVLQGFVQSTPIAGVLHRFPAKLTAEQFAGLL 
100 110 120 130 140 150 

390 400 410 420 430 440 

mlSO . pep RPLAPRLYSISSAQT^EVGDEVHLTVGWRFEHEGRARTGGASGFLADRLEEDGTVRVFVE 

I I I I I I I I I I I I : i I I : i I I I I I I I I : I I I I I ] I I I I : I I I I I I : I I I I I I I I ) I I I t : I 
g 1 5 0 RPLAPRL YS I S S SQAEAGDE VHLT VGAVRFEHEGRARAGGASG FFADRLEE DGTVRV FAE 

160 170 180 190 200 210 

450 460 470 480 490 500 

ml50 . pep RNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAEGKNWLIFGNPHFARDFLYQT 
llllllllllllllllttlllllllllllllllllllll]i:|||lllilill Mini 
gl50 RNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAEN7VEGRNWLIFGNPHFAADFLYQT 
220 230 240 250 260 270 

510 520 530 540 550 560 

ml50 . pep EWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLWQWLQEGAHIYVCGDAAKMT^ 

II I I I I I II I I I I I I I I I I I I I I I i I I I I I I t I I M I I I II II I I t I I I I I I I I I I t II I 
gl50 EWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLWQWLQEGAHIYVCGDAAKMAK 

280 290 300 310 320 330 

570 580 590 600 

ml50 . pep DVEAALLDVIIGAGHLDEEGAEEYLDMLREEKRYQRDVYX 
: I I I M I I I I I I I I I 11:111 I II I I I t I I I I I I I I I I 
gl50 EVEAALLDVIIGAGHSDEDGAEGYLDMLREEKRYQRDVYX 

340 350 360 370 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 61 1>: 

also . seq 

1 ATGCAGAACA CAAATCCGCC ATTACCGCCT ATGCCGCCCG AAATCACGCA 

51 GCTCCTGTCG GGGCTGGACG CGGCACAATG GGCGTGGCTG TCCGGCTACG 

101 CTTGGGCAAA AGCAGGAAAC GGGGCATCTG CAGGACTGCC CGCGCTTCAG 

151 ACGGCATTGC CGACGGCAGA ACCTTTTTCC GTAACCGTCC TTTCCGCCTC 

201 GCAAACCGGC AATGCGAAAT CCGTTGCCGA CAAAGCGGCG GACAGCCTGG 

251 AAGCCGCCGG CATCCAAGTC AGTCGCGCCG AACTGAAAGA CTATAAGGCG 

301 AAAAACATCG CCGGCGAACG CCGCCTGCTG CTGGTTACCT CCACCCAAGG 

351 CGAAGGCGAA CCGCCGGAAG AAGCCGTCGT GCTGCACAAA CTGCTGAACG 

401 GCAAAAAAGC CCCGATVATTG GACAAACTCC AATTTGCCGT ACTGGGTTTG 

451 GGCGACAGCT CCTATCCGAA TTTCTGCCGG GCGGGCAAAG ATTTCGACAA 

501 ACGTTTTGAA GAATTGGGCG CAAAACGCCT GCTCGAACGC GTTGATGCGG 

551 ATTTGGACTT TGCCGCCGCC GCAGACGGAT GGACAGATAA TATCGCCGCA 

601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCGCAGAC 

651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGCAAGG 

701 CAGACCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGA/^T CACCGCCCGC 

751 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA GCGGTTCGGA 

801 TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT GACAACGATC 

851 CGGCACTGGT CAGGGAAATC CTAGACCTGC TCGGCATCGA TCAGGCAACG 

901 GAAATACAGG CGGGCGGAAA AACCCTGCCG GTTGCCTCCG CACTGTTATC 

951 CCATTTTGAA CTCACGCAAA ACACCCCCGC CTTTGTCAAA GGCTATGCCC 

1001 CGTTCGCCGA TGATGACGAA CTCGACCGTA TTGCTGCCGA CAACGCCGTT 

1051 TTGCAAGGCT TTGTGCAAAG CACGCCGATT GCCGATGTGC TGCACCGCTT 

1101 CCCGGCAAAA CTGACAGCGG AACAATTCGC CGGCCTACTG CGCCCGCTTG 

1151 CGCCGCGCCT GTATTCGATT TCCTCGTCGC AGGCGGAAGT GGGGGACGAA 

1201 GTGCACCTGA CCGTCGGCGC GGTGCGTTTC GAACACGAAG GGCGCGCCAG 

1251 GGCGGGCGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

1301 CGGTGCGCGT GTTTGTGGAA CGCAACGACG GCTTCAGGCT GCCCGAAGAC 

1351 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 

1401 CCGCGCTTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA GGCAAAAACT 

1451 GGCTGTTTTT CGGCAATCCG CATTTTGCCC GTGATTTTCT CTATCATVACC 

1501 GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGAT ACGATTTCGC 

1551 CTGGTCGCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AAAATCCGCG 

1601 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GCATATCTAT 

1651 GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GACGTGGAAG CCGCCTTGCT 
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1701 GGATGTGATT ATCGGGGCAG GACATTTGGA CG/^GAGGGC GCAGAAGAAT 
1751 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 



This corresponds to the amino acid sequence <SEQ ID 612; ORF 150.a>: 

also. pep 
1 
51 

KNIAGERRLL LVTSTQGEGE PPEEAWLHK LLNGKKAPKL 



101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



MQNTNPPLPP MPPEITQLLS GLDAAQWAWL SGYAWAKAGN 
TALPTAEPFS VTVLSASQTG NAKSVADKAA DSLEAAGIQV 



GDSSYPNFCR AGKDFDKRFE ELGAKRLLER VDADLDFAAA 
LLKEEAAKNR ATPAPQTTPP AGLQTAPDGR YCKADPFPAA 
QSDKDVRHIE IDLSGSDLHY LPGDALGVWF DNDPALVREI 

LTQNTPAFVK GYAPFADDDE 
LTAEQFAGLL RPLAPRLYSI 
VHLTVGAVRF EHEGRARAGG ASGFLADRLE EDGTVRVFVE 
SRKPIVMIGS GTGVAPFRAF VQQRAAENAE GKNWLFFGNP 
EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD KIREQAEGLW 
VCGDAAKMAK DVEAALLDVI IGAGHLDEEG AEEYLDMLRE 



EIQAGGKTLP VASALLSHFE 
LQGFVQSTPI ADVLHRFPAK 



GASAGLPALQ 
SRAELKDYKA 
DKLQFAVLGL 
ADGWTDNIAA 
LLANQKITAR 
LDLLGIDQAT 
LDRIAADNAV 
SSSQAEVGDE 
RNDGFRLPED 
HFARDFLYQT 
QWLQEGAHIY 
EKRYQRDVY* 



m 1 50/a 1 50 94. 8% identity in 599 aa overlap 

10 

ml50 .pep 



20 



30 



40 



50 



60 



MQNTNPPLPPLPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPAAEPFS 
IIMII)lll:|||||||l|||IMIIIIIIIIIIIIII|||]ill|]||||ii:|||M 
a 1 5 0 MQNTNPPLPPMPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPTAEPFS 
10 20 30 40 50 60 

70 80 90 100 110 120 

mlSO . pep VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKNIAGERRLLLVTSTQGEGE 

I I M 1 t I t I I M I I t I I I I I M I I I I I I I I I I ! M I I I n M I I I I I I I I I I I [ 

also VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKNIAGERRLLLVTSTQGEGE 

70 80 90 100 110 120 



130 140 150 160 170 180 

mlSO.pep PPKEAWLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCQAGKDFDRRFEELGAKRLLER 

ll-llllllllilllll||||||ltl|||iil|||||||:||||||:|||||||i 

also PPEEAVVLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCRAGKDFDKRFEELGAKRLLER 

130 140 150 160 170 180 

190 200 210 220 230 240 

mlSO . pep VDADLDFTASANAWTDNIAALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKAAPFPAA 
NMIM:|:|::il||illlllillllll|||||||||||||||tllllllll ||||| 
al5 0 VDADLDFAAAADGWTDNIAALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKADPFPAA 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 50. pep LLANQKITARQSDKDVRHIEIDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPAT 

MIMIMIIIIIIIIIIIillllllllMlllMtlMMtlllllllMIIIIII II 
also LLANQK I TARQSDKDVRHIE I DLSGSDLHYLPGDALGVWFDNDPALVRE I LDLLGIDQAT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml50 . pep EIQAGGKMMPVARALSSHFELTQNTPAFVKGYAAFAHYEELDKIIADNAVLQDFVQNTPr 

IMMil :lll II Itlllllll :|||:| ||||||| |||:||| 

al50 EIQAGGKTLPVASALLSHFELTQNTPAFVKGYAPFADDDELDRIAADNAVLQGFVQSTPr 
310 320 330 340 350 360 

370 380 390 400 410 420 

ml 50. pep VDVLHRFPASLTAEQFIRLLRPLAPRLYSISSAQAEVGDEVHLTVGVVRFEHEGRARTGG 
'IIIIIIIIHIttll III! II III! tllhllltllll 11111:11 MM I ||t:|| 
also ADVLHRFPAKLTAEQFAGLLRPLAPRLYSISSSQAEVGDEVHLTVGAVRFEHEGRARAGG 

370 380 390 400 410 420 

430 440 450 460 470 480 

mlSO.pep ASGFLADRLEEDGTVRVFVERNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAE 

I I I I I I M I I I M M I I I I I I I i 1 1 I I I I |] II I I I I j t II I I II I I II I II t I I II I I I 

also ASGFLADRLEEDGTVRVFVERNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAE 
430 440 450 460 470 480 

490 500 SIO 520 530 540 

ml50 . pep GKNWLIFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLW 
lllltMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIII 
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also GKNWLFFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLW 
490 500 510 520 530 540 

550 560 570 580 590 600 

mlSO.pep QWLQEGAHIYVCGDAAKMAKDVEAALLDVIXGAGHLDEEGAEEYLDMLREEKRYQRDVYX 
I i I I I I I I I t I I I I 1 I I I 1 I I t I I j I I I I I I I I I I I I I I I j I I I I I I I I I I t I I I I I I I I 
al 5 0 QWLQEGAHI YVCGDAAKMAKDVEAALLDVI IGAGHLDEEGAEEYLDMLREEKRYQRDVYX 

550 560 570 580 590 600 



The following partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 613>: 

gl51 .seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCACACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTG 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TGGAGCGCGT TTTGGGGATG GTGGATTGCG TCGTCTTGTT GGTGGACGCA 

301 CAGGAAGGTC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAACCGTCCG 

401 CCCGTCCGAG CTGGGTTATC GACCAGACTT TCGAGTTGTT CGACAACTTG 

451 GGTGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGTTT 

501 GAGCGGCTTT GCCAAGCTGG AAGAAAccga CGAGAGCAGC GATATGCGCC 

551 CGCtgttcgA CACCATCCTA AAATACAcgc CtgCACCGAG CGGCAGCGCG 

601 GACGAGCCGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGCATC AAACCCGGCC 

701 AAACCGTTGC CGTGATGAAC CACGAGCAGC AAATCGCCCA AGGCCGCATC 

751 AACCAGCTTT TGGGTTTCAA AGGCTTGGAA CGCGTGCCGC TTGAAGAAGC 

801 CGAAGCCGGC GACATTGTGA TTATTTCCGG TATCGAAGAC ATCGGCATCG 

851 GCGTAACCAT CACCGACAAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTAAACA CCAGCCCGCT 

951 CGCAGGTACA GAAGGCAAAT TCGTGACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTTGCCC TGCGCGTGGA AGACACCGCC 

1051 GatgCCGACG TGTTCCGCGT ATCcgGGCGC GGCGAACTGC ACCTGACGAT 

1101 TTTGCTGGAA AATATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAGC 

1151 CGCGCGTCGT GTACCGAGAC ATCGACGGTC AAAAATGCGA ACCTTATGAA 

1201 AACCTGACTG TGGACGTACc cgacgacAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACCAA TATGGAAAGC GACGGCAACG 

1301 GacgCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGTTTC 

1351 CAAGGCGAAT TCATGACCCT GACGCGCGGC GTCGGGCTGA TGAgccacGT 

1401 GTTcgacgac tacgcgcccg tcaAACCCGA TATGCCCGGC CGCCACAACG 

1451 GCGTactggt GtcccaAGAG CAGGGCGAGG CGGTTGCTTA CGCCTTGTGG 

1501 AATCTTGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 

1551 CGAAGGTATG ATTATCGGCA TCCACAGCCG CGACAACGAT TTGGTGGTCA 

1601 ACCCGCTCAA AGGCAAAAAA CTCACCAATA TCCGTGCCAG CGGTACCGAC 

1651 GAAGCGGTGC GCCTGACCAC GCCGATCAAA CTGAcgcTGG AAGGCGCGGT 

1701 CGAGTTTATC GACGATGACG AGCTGGTGGA AATCACGCCG CAAtCCatCC 

1751 gcctgcgcat gcgttacctG AGCGaattgg aacgccgccg tcaTTTTAAA 

1801 AagctgGATT AA 

This corresponds to the amino acid sequence <SEQ ID 614; ORF 15Lng>: 

glSl.pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGCHIN IVDTPGHADF GGE VERVI^ VDCWLLVDA 

101 QEGPMPQTRF VTKKALAIiGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESS DMRPLFDTIL KYTPAPSGSA 

201 DEPLQIiQISQ LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HEQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVII5GIED IGIGVTITD K DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 

451 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 
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501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRMRYL SELERRRHFK 

601 KIiD* 

The following partial DNA sequence was identified in Kmeningitidis <SEQ ID 61 5>: 

mlSl.seq 

1 ATGAAACAAA TCC6CAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAQAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGCGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

301 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCXJAC AAGCCGTCCG 

401 CTCGTCCGAG CTGGGTTATC GACCAAACTT TCGAGCTGTT CGACAACTTG 

451 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGGTT 

501 GAGCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 

551 CGCTGTTCGA TACTATCTTA AAATATACGC CTGCACCGAG CGGCAGCGCG 

601 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGCATC AAACCCGGCC 

701 AAACCGTTGC CGTCATGAAC CACGATCAGC AAATCGCCCA AGGCCGCATC 

751 AACCAGCTTT TGGGTTTCAA AGGTTTGGAA CGCGTGCCGC TTGAAGAAGC 

801 CGAAGCCGGC GACATCGTGA TTATTTCCX3G TATCGAAGAC ATCGGTATCG 

851 GCGTAACCAT CACCGACAAA GACAATCCCA AAGGCCTACC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGCT 

951 GGCGGGTACG GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCX3AGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGATGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

1301 GACGCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGTTTC 

1351 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 

1401 GTTCGACGAT TACGCX3CCCG TCAAACCCGA TATGCCCGGC CGCCACAACG 

1451 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 

1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 

1551 CGAAGGCATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 

1601 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 

1651 GAAGCCGTTC GCCTGACCAC GCCAATCAAG CTGACGCTGG AAGGTGCGGT 

1701 TGAGTTTATC GACGATGACG AACTCGTTGA AATCACGCCG CAATCCATCC 

1751 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG CCACTTTAAA 

1801 AAGCTGGATT GA 

This corresponds to the amino acid sequence <SEQ ID 616; ORF 151>: 

mis 1 .pep 

1 MKQIRNIAII AHVDHGKTTb VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEOLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HDQQIAQGRI 

251 NQLIiGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 

451 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 

501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRKRYL SELERRRHFK 

601 KLD* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from ^gonorrhoeae 

ORF 151 shows 99.2% identity over a 603 aa overlap with a predicted ORF (ORF ISl.ng) 
from N, gonorrhoeae: 
mlSl/glSl 

10 20 30 40 50 60 

ml51 . pep MKQIRNIAIIAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 
I 1 M I I I I M I I I I I M It I I I I I I I I I I I I I I I t I I ) I I I I I I I I I I I I I I I t I I I I I I 
gl51 MKQIRNIAIIAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 

10 20 30 40 50 60 

70 80 90 100 110 120 

itllSl . pep AIDYEGYHINIVDTPGHADFGGEVERVLGMVDCWLLVDAQEGPMPQTRFVTKKALALGL 
Mini I M I I I I I I I i. t I I i I I I I I I I I M i I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl51 AIDYEGCHINIVDTPGHADFGGEVERVLGMVDCWLLVDAQEGPMPQTRFVTKKALALGL 

70 80 90 100 110 120 

130 140 150 160 170 180 

mlSl.pep KPIWINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESN 
tMIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIMMIIIIIIIIMI: 
gl51 KPIWINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESS 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml51 . pep DMRPLFDTILKYTPAPSGSADETLQLQISQLDYDNYTGRLGIGRILNGRIKPGQTVAVMN 

I I M I I I I I Ill t II I t I I I II I I I I II I II I I I I I I I I II I 11 

gl51 DMRPLFDTILKYTPAPSGSADEPLQLQISQLDYDNYTGRLGIGRILNGRIKPGQTVAVMN 
190 200 210 220 230 240 

250 260 270 280 290 300 

mlSl.pep HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 

■•lllllllllllllllllllllllllltll I II I I I I I II I I II II I II II II t 

gl51 HEQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml51 . pep VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 

I M M I M f I I M I I I I I II Ill I I I I I M I I I I I t I I I I I I I I I 

gl51 VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml51 . pep GELHLTILLENMRREGYELAVGKPRWYRDIDGQKCEPYENLTVDVPDDNQGAVMEELGR 

I I M I I I I I II I I I I I I I I II lilt II II Mill I I I II I III I I III 

gl51 GELHLTILLENMRREGYELAVGKPRWYRDIDGQKCEPYENLTVDVPDDNQGAVMEELGR 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml51 . pep RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 
lllllMIMIIItllllllllllllllllltllllllllllllllllllllllllllll 
gl51 RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 

430 440 450 460 470 480 

490 500 510 520 530 540 

mlSl.pep RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMIIGIHSRDNDLWNPLKGKK 
I > I i I I I M I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M t I 
gl51 RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMIIGIHSRDNDLWNPLKGKK 

490 500 510 520 530 540 

550 560 570 580 590 600 

mlSl.pep LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 

I I I M I I I I I I I I I I I I I t Illlllllllllllllllllll Mini I 

gl51 LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRMRYLSELERRRHFK 

550 560 570 580 590 600 
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ml 51. pep KLDX 
I I I I 

gl51 KLDX 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 617>: 

alSl . seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGAGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

301 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA TAAAATCGAC AAACCGTCCG 

401 CCCGTCCGAG CTGGGTCATC GACCAAACTT TCGAGCTGTT CGACAACTTG 

451 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTATG CTTCCGGTCT 

501 GTCCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 

551 CGCTGTTCGA TACTATCTTA AAATATACGC CTGCACCGAG CGGCAGCGCG 

601 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGTATC AAGCCCGGTC 

701 AAGTTGTTGC CGTCATGAAC CACGATCAAC AAATCGCCCA AGGCCGCATC 

751 AACCAGCTTT TGGGTTTCAA AGGTTTAGAA CGCGTGCCGC TTGAAGAAGC 

801 CGAAGCCGGC GACATCGTGA TTATTTCCGG TATTGAAGAC ATCGGCATCG 

851 GCGTAACCAT CACCGACAAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGTT 

951 GGCAGGTACG GAAGGCAAAT TCGT7WVCCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGACGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

1301 GACGCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGCTTC 

1351 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 

1401 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCTGGC CGCCACAACG 

1451 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 

1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 

1551 CGAAGGTATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 

1601 ACCCGCTCAA AGGCAAAA7VA CTTACCAACA TCCGTGCCAG CGGTACCGAC 

1651 GAAGCCGTTC GCCTGACCAC GCCGATTAAG CTGACGCTGG AAGGTGCGGT 

1701 CGAGTTTATC GACGATGATG AGCTGGTAGA AATCACGCCG CAATCCATCC 

1751 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG CCATTTCAAA 

1801 AAGCTAGATT GA 

This corresponds to the amino acid sequence <SEQ ID 618; ORF 15La>: 

al51.pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQWAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRVVYRD IDGQKCEPYE 

401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 

451 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 

501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

" 551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRKRYL SELERRRHFK 

601 KLD* 

mlSl/al51 99.8% identity in 603 aa overlap 

10 20 30 40 50 60 
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ml51 . pep MKQIRNIAIIAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 
lllllllltlllllllltlllllllillllltlllllllllllllllillllllllllll 
al51 MKQIRNIAIIAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 
10 20 30 40 50 60 

"70 80 90 100 110 120 

ml51 . pep AIDYEGYHINIVDTPGHADFGGEVERVLGMVDCWLLVDAQEGPMPQTRFVTKKALALGL 
IIIIIMIIIIIIIIIIllllllllMIIIIIIIIIIMllllillllinillllllll 
al51 AIDYEGYHINIVDTPGHADFGGEVERVLGMVDCWLLVDAQEGPMPC3TRFVTKKALALGL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 5 1 . pep KPI WINKI DKPSARPSWVI DQTFELFDNLGATDEQLDFPI VYASGLSGFAKLEETDESN 

lllllllllllllllllllllllllllllllllllllllllllllllllllllltlllll 
al51 KPIWINKI DKPSARPSWVI DQTFELFDNLGATDEQLDFPI VYASGLSGFAKLEETDESN 

130 140 150 160 170 180 

190 200 ■ 210 220 230 240 

ml 51. pep DMRPLFDTILKYTPAPSGSADETLQLQISQLDYDNYTGRLGIGRILNGRIKPGQTVAVMN 
llllllllilllll]||||||||||||illllllllllllllitllllllllli:||||| 
a 1 5 1 DMRPLFDTILK YT PAPSGSADETLQLQI SQLDYDNYTGRLGIGRILNGRIKPGQVVAVMN 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 51. pep HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 
lltillllllllllllllilllllllllllllllllllllllltllllllllllllllll 
al51 HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 5 1 . pep VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 
I I I I I I I I I I t I I I I I I I I I I t I I I I I I i I I I I I I I I I I I I I I I I I I I I i I I I I I I I j I I 
al51 VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 51 , pep GELHLTILLENMRREGYELAVGKPRVVYRDIDGQKCEPYENLTVDVPDDNQGAVMEELGR 

lllMMIIIIIIIIIMMMIillltlllMllltllllilllllMIIIMIIIIII 
a 1 5 1 GELHLTILLENMRREGYELAVGKPRVVYRDI DGQKCEP YENLTVDVPDDNQGAVMEELGR 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml51 . pep RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 
IIIMIIIIIIIIIIIIIIIIIIIllllllllllllllltlllltlllllllllllllll 
alSl RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml51 . pep RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMIIGIHSRDNDLVVNPLKGKK 

llllllllllillllllitllllllltlllllllllltlllllllllllllllllllllt 

a 1 5 1 RHNGVLVSQEQGEAVAYALWNLEDRGRMFVS PNDKI YEGMIIGIHSRDNDLVVNPLKGKK 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 51. pep LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 

MIIIIMIIIIIIIIIMIIIIIIilllllllllllllllllllllllliillllllM 

al51 LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 
550 560 570 580 590 600 



ml 51. pep KLDX 
I I I I 

al51 KLDX 



The following partial DNA sequence was identified in Kgonorrhoeae <SEQ K) 619>: 
gl52.seq 

1 ATGAAAAaca aAACCaaagt ctgGGacttc cCcacccgcc ttTTCCactG 
51 GctgcttgCC gCATCCctgc CCTTTATGTG gtatagCGCA AAAGCCGGCG 
101 GcgataTGCT GcaatgGCAC ACGCGCGTCG GGCTGCTCGT CCTTTTCCTG 
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151 CTCGTATTCC GCCTCTGCTG GGGCATTTGG GGCAgcgATA CCGCCCGTTT 

201 CTCccgTtTC GTCCGAGGTT GGGCAGGTAT ACGCGGCTAT CTGAAAAAcg 

251 gCATTCCCGA ACAtatcCAiS CCCGGACACA ACCCCTTGGG CGCACTgatg 

301 gtcGTTGCGC TTTTGgccgc cgtcTCATTT CAagtcggcA CGGGGCTTTT 

351 Tgccgccaat gaaaacacct tcagcaCCAa cggctacctc aaccatttgg 

401 tttccgaaca tacgGGCAGC CTTATACGGA AAATCCACCT CAACTTTTTC 

451 AAGCTGCTCG CCGTTTTTTC CGCAGTCCAC ATCGCCGCCG TCGCCGCATA 

501 CCGCATATTC AAAAAGAAAA ACCTCGTCCG CCCGATGATA ACCGGCTTCA 

551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCCGGCAA AGCCGCGCTT 

601 GCCGCCGCAT TATCGGTTGC CGCGCTTGCC GCAGCCGCCA TCCTGCTCCT 

651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ED 620; ORF 152.ng>: 

gl52 .pep 

1 MKNKTKVWDF PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH TR VGLLVLFL 

51 LVFRLCWGI W GSDTARFSRF VRGWAGIRGY LKNGIPEHIQ PGHNP LGALM 

101 WALIiAAVSF QV GTGLFAAN ENTFSTNGYL NHLVSEHTGS LIRKIHLNFF 

151 KLLAVFSAVH lAAVA AYRIF KKKNLVRPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVAALA AAAILLL S* 

The following partial DNA sequence was identified in Kmeningitidis <SEQ ID 621>: 



ml52 , seq 












1 


ATGAAAAACA 


AAACCAAAGT 


CTGGGACCTC 


CCCACCCGCC 


TTTTCCACTG 


51 


GCTGCTTGCC 


GCGTCCCTGC 


CCTTTATGTG 


GTATAGCGCG 


AAAGCCGGCG 


101 


GCGATATGCT 


GCAATGGCAC 


ACGCGCGTCG 


GGCTGTTCGT 


CCTTTTCCTG 


151 


CTCGTATTTC 


GCCTCTGCTG 


GGGCATTTGG 


GGCAGCGATA 


CCGCCCGTTT 


201 


TTCCCGTTTC 


GTCCAAGGCT 


GGGCAGGCAT 


ACGCGGCTAT 


CTGAAAAACG 


251 


GTATTCCCGA 


ACACATCCAG 


CCCGGACACA 


ACCCCTTGGG 


CGCACTGATG 


301 


GTCGTTGCGC 


TTTTGGCCGC 


CGTGTCCTTC 


CAAGTCGGCA 


CCGGGCTTTT 


351 


TGCCGCCGAT 


GAAAACACCT 


TCAGCACCAA 


CGGCTACCTC 


AACCATTTGG 


401 


TTTCCGAACA 


TACGGGCAGC 


CTTATGCGGA 


AAATCCACCT 


CAACTTTTTC 


451 


AAGCTGCTCG 


CCGTTTTTTC 


TGCAATCCAC 


ATCGCCGCCG 


TCGCCGCATA 


501 


CCGCGTATTC 


AAAAAGAAAA 


ACCTCATCCT 


CCCGATGATA 


ACCGGCTTCA 


551 


AATACATCGA 


AGGCAAAACC 


TCAATCCGCT 


TTGCAGGCAA 


AGCCGCGCTT 


601 


GCCGCCGCAT 


TATCGGTTGC 


CTCGCTTGCC 


GCAGCCGCCA 


TCCTGCTCCT 


651 


GTCCTGA 











This corresponds to the amino acid sequence <SEQ ID 622; ORF 152>: 

ml52 .pep 

1 MKNKTKVWDL PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH T RVGLFVLFL 

^1 LVFRLCWGI W GSDTARFSRF VQGWAGIRGY LKNGIPEHIQ PGHNP LGALM 

101 WALIiAAVSF QV GTGLFAAD ENTFSTNGYL NHLVSEHTGS LMRKIHLNFF 

151 KLLAVFSAIH lAAVA AYRVF KKKNLILPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVASLA AAAILLL S* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF firom N,^onorrhoeae 

ORP 152 shows 95.4% identity over a 218 aa overlap with apredicted ORF (ORF 152.ng) 
firom K gonorrhoeae: 

ml52/gl52 



ml52 ,pep 
gl52 



ml52 .pep 
gl52 

70 80 90 100 110 120 



10 20 30 40 50 60 

MKNKTKVWDLPTRLFHWLLAASLPFMWYSAKAGGDMLQWHTRVGLFVLFLLVFRLCWGIW 

IIIIIIIIIHIIIIIIIIIilllllllllllllllllilllllhllMIIIIIIIIII 

MKNKTKVWDFPTRLFHWLLAAS LP PMWYSAKAGGDMLQWHTRVGLLVLFLLVFRLCWGI W 
10 20 30 40 50 60 

70 80 90 100 110 120 

GSDTARFSRFVQGWAGIRGYLKNGIPEHIQPGHNPLGALMWALLAAVSFQVGTGLFAAD 

llllllllllhlllllllllllllllMllllllllllilllllllllllllllllll: 
GSDTARFSRFVRGWAGIRGYLKNGIPEHIQPGHNPLGALMWALLAAVSFQVGTGLFAAN 
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130 140 150 160 170 180 

ml52 . pep ENTFSTNGYl^LVSEHTGSLMRKIHLNFFKLIiAVFSAIHIAAVAAYRVFKKKNLI 

llllllllllllllllllllhlllllllllllllllhllllllllhllllll: III 
gl52 ENTFSTNGYLNHLVSEMTGSLIRKIHLNFFKLIiAVFSAVHIAAVAAYRIFKKKNLVRPMI 
130 140 150 160 170 180 



190 200 210 219 

ml52 .pep TGFKYIEGKTSIRFAGKAALAAALSVASIiAAAAILLLSX 

IIIIIIIIMIMIIIIIIMIIIIIhlllllllllll 

gl52 TGFKYIEGKTSIRFAGKAALAAALSVAALAAAAILLLSX 
190 200 210 

The following partial DNA sequence was identified in M meningitidis <SEQ E) 623>: 

al52 . seq 

1 ATGAAAAACA AAACCAAAGT CTGGGACTTC CCCACCCGCC TTTTCCACTG 

51 GCTGCTTGCC GCATCCCTAC CCTTTATGTG GTATAGCGCG AAAACCGGCG 

101 GCGATATGCT GCAATGGCAC ACGCGCGTCG GGCTGTTTAT CCTTTTCCTG 

151 CTCGTATTCC GCCTCTGCTG GGGCATTTGG GGCAGCGATA CCGCCCGTTT 

201 CTCCCGTTTC GTCCGCGGAT GGTCGGGTAT CAGAGAGTAT ATGAAAAACG 

251 GTATTCCCGA ACACGTCCAA CCCGGACACA ACCCCTTGGG CGCACTGATG 

301 GTCGTTGCGC TTTTGGCCGC CGTGTCGTTC CAAGTCGGCA CAGGGCTTTT 

351 TGCCGCCGAT GTAAACACCT TCAGCACCAA CGGCTACCTC AACCATTTGG 

401 TTTCCGAACA TACGGGCAGC CTTATGCGGA AAATCCATCT CAACTTTTTC 

451 AAACTGCTCG CCGTTTTTTC CGCAGTCCAC ATCGCCGNCG TCGCCGCATA 

501 CCGCGTGTTC AAAAAGAAAA ACCTCGTCCT CCCGATGATA ACCGGCTTCA 

551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCCGGCAA AGCCGCGCTT 

601 GCCGCCGCAT TATCGGTTGC CGCGCTTGCC GCAGCCGCCA TCCTGCTCCT 

651 GTCCTGA 



This corresponds to the amino acid sequence <SEQ E) 624; ORF 152.a>: 

al52 .pep 

1 MKNKTKVWDF PTRLFHWLLA ASLPFMWYSA KTGGDMLQWH TRVGLFILFL 

51 LVFRLCWGI W GSDTARFSRF VRGWSGIREY MKNGIPEHVQ PGHNP LGALM 

101 VVALLAAVSF QV GTGLFAAD VNTFSTNGYL NHLVSEHTGS LMRKIHLNFF 

151 KLLAVFSAVH lAXVAA YRVF KKKNLVLPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVAALA AAAILLLS* 



ml 52/al 52 94.0% identity in 2 1 8 aa overlap 

10 20 30 40 50 60 

ml52 . pep MKNKTKVWDLPTRLFHWLLAASLPFMWYSAKAGGDMLQWHTRVGLFVLFLLVFRLCWGIW 
I I I I I i I I I : I I I I I I i I I i I I I I j I I I I I 1 : I I I I I I I I i I I I I I : I I I I I I I I I I I I I 
al52 MKNKTKVWDFPTRLFHWLLAASLPFMWYSAKTGGDMLQWHTRVGLFILFLLVFRLCWGIW 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml52 . pep GSDTARFSRFVQGWAGIRGYLKNGIPEHIQPGHNPLGALMWALLAAVSFQVGTGLFAAD 
I I I t I I I M I I : M: I I I I : I I I I I I I : I I I I I I I I I I I I I I I I I I I I M I 1 I I I I I I I 
al52 GSDTARFSRFVRGWSGIREYMKNGIPEHVQPGHNPLGALMWALLAAVSFQVGTGLFAAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

inl52 . pep ENTFSTNGYLNHLVSEHTGSLMRKIHLNFFKLLAVFSAIHIAAVAAYRVFKKKNLILPMI 
I I ) I I I I I I i I I I I I I I I I I I I i I i ] I I I I I I t I I I I : 1 i I I I I I I I I I I I I I : I I I i 
al52 VNTFSTNGYLNHLVSEHTGSLMRKIHLNFFKLLAVFSAVHIAXVAAYRVFKKKNLVLPMI 

130 140 150 160 170 180 



190 200 210 219 

ml52 . pep TGFKYIEGKTSIRFAGKAALAAALSVASLAAAAILLLSX 
i I I I I 1 t I I I M n I I I I I I I I I I I I I : I I I I I I I M I I 
al52 TGFKYIEGKTSIRFAGKAALAAALSVAALAAAAILLLSX 

190 200 210 
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The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 625>: 

gl53.8eq 



1 


atggggtttg 


cttaCAgtat 


gacgtatatc 


gaggtCGGGa 


taccggaggc 


51 


ggcatccgtc 


ctttCgctGC 


CCGAGATgat 


gcgcctgatG 


GTGTTtCagg 


101 


attATGGTTT 


TttggcCGAA 


GTGATGTTTG 


TGctgaCTTT 


cGGCGcgcCG 


151 


GTTCTGTTtC 


TGCTGCTGTG 


CCTGTATGTC 


TATGCCGCGC 


TGATACGGAA 


201 


ACAGGCGTAT 


CCTGCGCTGC 


GTTTGGCAAC 


GCGTGTGATG 


GTGCGCTTGA 


251 


GGCAGGCGAT 


GATGGTGGAT 


GTGTTTTTTG 


TTTCCACTCT 


GGTGGOGTAT 


301 


ATCAAGCTCT 


CGTCTGTGGC 


AAAGGTTCGC 


TTCGGGCCGG 


CGTTTTATCT 


351 


GATGTTCGCG 


CTGTCGGTTA 


TGCTGATTCG 


GACTTCGGTA 


TCGGTTCCCC 


401 


AGCATTGGGT 


GTATTTCCAA 


ATCGGGCGGC 


TGACGGGGAA 


TAATGCGGTT 


451 


CAGACGGCAT 


CGGAAGGCAA 


AACCTGTTGC 


AGCCGCTGCC 


TGTATTTccg 


501 


cgacAGTgcc 


gaatccCCCT 


GCGGGGTGTg 


cgGCGcggaA 


CTgtacggcg 


551 


gacggccgaa 


aagtCTGAGt 


atttCgtCGG 


CGTTTCTgac 


ggcggcggTT 


601 


GTTTTGTATT 


TCCctgCcaa 


TATCctgccg 


attaTGAttt 


cgtccAATCc 


651 


tgccgccacg 


GAGGcCAACA 


CCATCTTTAG 


CGGCATCGCT 


TATATGTGGG 


701 


ACGagggcgA 


CAGGCTGATT 


GCGGCGGTTA 


TTTTCAGCGC 


GAGTATTTTG 


751 


GTGCCGGTGC 


TGAAGATTGC 


GGCAATGTCG 


GTTTTGATTG 


CGGCGGCACG 


BOl 


GTTCGCTTTG 


CCGGCGGGCG 


CAAAGAAATT 


GTCGCACCTC 


tacCGCATCA 


851 


CCGAAGCGGT 


CGGCCGCTGG 


TCGATGATTG 


ATATTTTTGT 


GATTATTATT 


901 


TTGATGTGTT 


CGTTCCacaC 


TTATGCCGCG 


CGCGTCATTC 


CGGGCAGTGC 


951 


GGCAGTCTAT 


TTCTGCCTGG 


TCGTGATTTT 


GACGATGCTG 


TCCGCCTATT 


1001 


ATTTCGACCC 


GCGCCTGCTT 


TGGGACAAAC 


GCGCTTCAGA 


CGGCATTGCT 


1051 


TTCAACGAAA 


CGGAAAAATA 


TGACTGA 






This corresponds to the amino acid sequence <SEQ E) 626; ORF 153.ng>: 


gl53.pep 












1 


MGFAYSMTYI 


EVGIPEAASV 


LSLPEMMRLM VFQDYGFIAE 


VMFVLTFGAP 


51 


VLFLIiIiCLYV YAALIRKQAY 


PAIiRIiATRVM VRLRQAMMVD 


VFFVSTLVAY 


101 


IKLSSVAKVR 


FGPAFYLMFA 


LSVMLIRTSV SVPQHWVYPQ 


IGRLTGNNAV 


isi 


QTASEGKTCC 


SRCLYFRDSA 


ESPCGVCGAE 


LYGGRPKSLS 


ISSAFLTAAV 


201 


VLYFPANILP 


IMISSNPAAT 


BANTIFSGIA 


YMWDEGDRLI 


AAVIFSASIL 


251 


VPVLKIAAMS 


VLIAAARFAL 


PAGAKKLSHL 


YRITEAVGRW 


SMIDIFVIII 


301 


LMCSFHTYAA 


RVIPGSAAVY 


FCLWILTML 


SAYYFDPRLL 


WDKRASDGIA 


351 


FNETEKYD* 











The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 627>: 

ml53 .seq 

1 ATGGCGTTTG CTTACGGTAT GACGTATATC GAGGTCGGGA TACCGGGTGC 

51 GGCATCCGTC CTTTCGCTGC CCGAGATGAT GCGCCTGATG GTGTTTCAGG 

101 ATTATGGTTT TTTGGCCGAA GTGATGTTTG TGCTGACTTT CGGCGCGCCG 

151 GTTCTGTTTC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCGCTTGA 

251 GACAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTTT GGTGGCGTAT 

301 ATCAAGCTCT CGTCTGTGGC AGAGGTTCGC TTCGGGCCGG CGTTTTATCT 

351 GATGTTCGCG CTGTCAGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

401 AGCATTGGGT GTATTTTCAA ATCGGGCGGC TGACGGGGGA TAATGCGGTT 

451 CAGACGGCAT CGGAAGGTAA AACCTGTTGC AGCCGCTGCC TGTATTTCCG 

501 CGACAGTGCC GAATCCCCCT GCGGCGTGTG CGGTGCGGAA CTGTACCGCC 

551 GACGGCCGAA AAGTCTGAGT ATTTCGTCGG CGTTTCTGAC GGCGGCGGTT 

601 ATTTTGTATT TCCCTGCCAA TATCCTGCCG ATTATGATTT CGTCCAATCC 

651 TGCCGCCACG GAGGTCAATA CCATCCTTAA CGGCATCGCT TATATGTGGG 

701 ACGAGGGCGA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 

751 GTGCCGGTAC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGTCCGCCCG 

BOl CTTCGCTTTG CCAACGGGTG CAAAGAAATT GTCGCACCTC TACCGCATCA 

851 CCGAAGCGGT CGGCCGCTGG TCGATGATTG ATATTTTTGT GATTATTATT 

901 TTGATGTGTT CGTTCCACAC TTATGCCGCG CGCGTCATTC CGGGCAGTGC 

951 GGCAGTCTAT TTCTGCCTGG TCGTGATTCT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC GCGCTTCAGA CGGCATTGCT 

1051 TTCAATGAAA CGGAAAAACA TGACTGA 

This corresponds to the amino acid sequence <SEQ ID 628; ORF 153>: 

ml53 .pep 

1 MAFAYGMTYI EVGIPGAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

51 VLFLLLC LYV YAALIRKQAY PALRLATRVM VRLRQ AMMVD VFFVSTLVAY 
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101 IKLSSVAEVR FGP AFYLMFA LSVMLIRTSV SVPQHWVYFQ IGRLTGDNAV 

151 QTASEGKTCC SRCLYFRDSA ESPCGVCGAE LYRRRPKSLS ISS AFLTAAV 

201 ILYFPANILP I MISSNPAAT EVNTILNGIA YMWDEGDRLI AAVIFSASIL 

251 VPVLKIAAMS VLIAS ARFAL PTGAKKLSHL YRITE AVGRW SMIDIFVIII 

301 LMCSFHTYAA RVIPGSAAVY FCLWILT ML SAYYFDPRLL WDKRASDGIA 

351 PNETEKHD* 

inl53 / gl53 96.1% identity in 358 aa overlap 

10 20 30 40 50 60 

ml 5 3 . pep MAFAYGMTYIEVGI PGAASVLS LPEMMRLMVFQDYGFLAEVMPVLTFGAPVLFLLLCLYV 

hll|:|llllllll llllllllllllllllllllllllllllllllllllllllllll 
gl53 MGFAYSMTYIEVGIPEAASVLSLPEMMRIMVFQDYGFLAEVMFVLTFGAPVLFLIiLCLYV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 5 3 . pep YAALI RKQAYPALRLATRVMVRIiRQAMMVDVFFVSTLVAYI KLSS VAEVRFG PAFYLMFA 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIhllllllllllll 

glS3 YAALIRKQAYPALRLATRVMVRLRQAMMVDVFFVSTLVAYIKLSSVAKVRFGPAFYLMFA 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 5 3 .pep LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCXSAE 
lllllllllllllllllllllllllhlllllllllllllllllllllllllllllllll 
gl53 LSVMLIRTSVSVPQHWVYFQIGRLTGNNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml53 .pep LYRRRPKSLSISSAFLTAAVILYFPANILPIMISSNPAATEVNTILNGIAYMWDEGDRLI 
II lllllllllllllllhlllllllllll|||||ll||:||h:||||||||||||| 
gl53 LYGGRPKSLSISSAFLTAAWLYFPANILPIMISSNPAATEANTIFSGIAYMWDEGDRLI 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml53 . pep AAVIFSASILVPVLKIAAMSVLIASARFALPTGAKKIiSHLyRITEAVGRWSMIDIFVIII 

lllllllllllllllllllllllhlllllhllllllllllllllllllllllllllll 
gl53 AAVIFSASILVPVLKIAAMSVLIAAARFALPAGAKKLSHLYRITEAVGRWSMIDIFVIII 
250 260 270 280 290 300 

310 320 330 340 350 359 

ral53.pep LMCSFHTYAARVIPGSAAVYFCLWILTMLSAYYFDPRLLWDKRASDGIAFNETEKHDX 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIhll 

gl53 LMCSFHTYAARVIPGSAAVYFCLWILTMLSAYYFDPRLLWDKRASDGIAFNETEKYDX 
310 320 330 340 350 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 629>: 

al53 , seq 

1 ATGGCGTTTG CTTACGGTAT GACGTATATC GAGGTCGGGA TACCGGGTGC 

51 GGCATCCGTC CTTTCGCTGC CCGAGATGAT GCGCCTGATG GTGTTTCAGG 

101 ATTATGGTTT TTTGGCCGAA GTGATGTTTG TGCTGACCTT CGGCGCGCCG 

151 GTTCTGTTTC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCGCTTGA 

251 GACAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTTT GGTGGCGTAT 

301 ATCAAGCTCT CGTCTGTGGC AGAGGTTCGC TTCGGATCGG CGTTTTATCT 

351 GATGTTCGCG CTGTCGGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

4 01 AGCATTGGGT GTATTTTCAA ATCGGGCGGC TGACGGGGGA TAATGCGGTT 

4 51 CAGACGGCAT CGGAAGGTAA AACCTGTTGC AGCCGCTGCC TGTATTTCCG 

501 CGACAGTGCC GAATCCCCCT GCGGCGTGTG CGGTGCGGAA CTGTACCGCC 

551 GACGGCCGAA AAGTCTGAGT ATTTCGTCGG CGTTTCTGAC GGCGGCGGTT 

601 ATTTTGTATT TCCCTGCCAA TATCCTGCCG ATTATGATTT CGTCCAATCC 

651 TGCCGCCACG GAGGTCAATA CCATCCTTAA CGGCATCGCT TATATGTGGG 

701 ACGAGGGCGA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 

751 GTGCCGGTAC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGTCCGCCCG 

801 CTTCGCTTTG CCAACGGGTG CAAAGAAATT GTCGCACCTC TACCGCATCA 
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851 CCGAAGCGGT CGGCCGCTGG 

901 TTGATGTGTT CGTTCCACAC 

951 GGCAGTCTAT TTCTGCCTGG 

1001 ATTTCGACCC GCGCCTGCTT 

1051 TTCAATGAAA CGGAAAAACA 



TCGATGATTG ATATTTTTGT GATTATTATT 
TTATGCCGCG CGCGTCATTC CGGGCAGTGC 
TCGTGATTCT GACGATGCTG TCCGCCTATT 
TGGGACAAAC GCGCTTCAGA CGGCATTGCT 
TGACTGA 



This corresponds to the amino acid sequence <SEQ ID 630; ORF 153.a>: 

al53 .pep 

1 MAFAYGMTYI EVGIPGAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

51 VLFLLLCLYV YAALIRKQAY PALRLATRVM VRLRQ AMMVD VFFVSTLVAY 

101 IKLSSVAEVR FGS AFYLMFA LSVMLIRTSV SVPQHWVYFQ IGRLTGDNAV 

151 QTASEGKTCC SRCLYFRDSA ESPCGVCGAE LYRRRPKSLS ISSAFLTAAV 

201 ILYFPANILP I MISSNPAAT EVNTILNGIA YMWDE6DRL I AAVIFSASIL 

251 VPVLKIAAMS VLIASARFAL PTGAKKLSHL YRITE AVGRW SMIDIFVIII 

301 LMCSFHTYAA R VIPGSAAVY FCLWILT ML SAYYFDPRLL WDKRASDGIA 

351 FNETEKHD* 



ml53/al53 99.7% identity in 358 aa overlap 

10 20 30 40 50 60 

ml53 . pep MAFAYGMTYI EVG I PGAASVLSLPEMMRLMVFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 

I I I I I 1 I I I I I I I I I I I t I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a 1 5 3 MAFAYGMTYIEVGI PGAASVLSLPEMMRLMVFQDYGFLAEVMFVLT FGAPVLFLLLCLYV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml53 . pep YAALIRKQAYPALRLATRVMVRLRQAMMVDVFFVSTLVAYIKLSSVAEVRFGPAFYLMFA 
I I I I I M I I I t I t I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I M I I I I I I I I 
al53 YAALIRKQAYPALRLATRVMVRLRQAMMVDVFFVSTLVAYIKLSSVAEVRFGSAFYLMFA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml53 . pep LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 
t I I I I I I M I I I I I I I I I I M I M I I I M I I I I I I I I I I I I I i I I I I I I I I I I I ] I I I I I 
al53 LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 
130 140 150 160 170 180 



190 200 210 220 230 240 

ml53 . pep LYRRRPKSLS I SSAFLTAAVILYFPANILPIMISSNPAATEVNTILNGIAYMWDEGDRLI 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al53 LYRRRPKSLSISSAFLTAAVILYFPANILPIMISSNPAATEVNTILNGIAYMWDEGDRLI 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml53.pep AAVIFSASILVPVLKIAAMSVLIASARFALPTGAKKLSHLYRITEAVGRWSMIDIFVIII 
MIIIMIIIIMMMMIIItllllllllillllllllltMlllllllllllllllt 
al 5 3 AAVI FSAS ILVPVLKI AT^S VLI AS ARFALPTGAKKLSHLYRITEAVGRWSMI DI FVI 1 1 

250 260 270 280 290 300 



310 320 330 340 350 359 

ml53 . pep LMCSFHTYAARVIPGSAAVYFCLWILTMLSAYYFDPRLLWDECRASDGIAFNETEKHDX 

1 1 1 1 ) 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 n I I I I I I I I t I I I I I I I I I Ml I I I I I I I I I I I I I I I 

al53 LMCSFHTYAARVIPGSAAVYFCLWILTMLSAYYFDPRLLWDKRASDGIAFNETEKHDX 
310 320 330 340 350 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 63 1 >: 

gl54.seq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCTCAAG CACGCGTCCG 

51 CAAAAACAAC accttcctCT CCGCCGTCTG GCTGGTCCCG CTGATCGCGC 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 

151 GTGGTTACGC TCTTGATGGA CAGCGCGGAA GGCATCGAAG TCAACAATAC 

201 GGTCATTAAG GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTTA CTGCCCAACT CAATGCGGAC 

301 GTATCCGGCC TCATCCGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 

351 TATCGACCAA AGCGGcgtAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 
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401 


ACATCGCTTT 


TACACCCGGC 


451 


GTGCAGGACA 


TTCCGCCCGT 


501 


GAATTTGATT 


GGTAAAAACG 


551 


TGTATGAAAA 


CTTTATGGTC 


601 


TCCGACCAAA 


GCGTGCATTA 


651 


ACTGATTCAT 


TCCGCCAGCC 


701 


AAACCACAGG 


CAGCGGCATC 


751 


CTGTCAGGCG 


CGATTTCATT 


801 


CGTCAAAAGC 


GAGGACAGCT 


851 


CCAACCTGCC 


TGACGACCGC 


901 


TCCGTGCGCG 


GACTGACCGT 


951 


TGTcggCATG 


GTTTCCX3ATG 


1001 


ACCtgtTTGA 


aaacggctgg 


1051 


CGTTTGGAAA 


TCAATGCCGA 


1101 


ATTCCAGACG 


GCCTTAAACA 


1151 


ACCTGCTGAC 


CGGCGGCAAA 


1201 


TCGCCCAAGC 


TGCGACCGCA 


1251 


CACACGGGGC 


GGCGGTTTGG 


1301 


TGGACaaatT 


CAACAATCTG 


1351 


GGCTCGCTCG 


CCGAACTCAA 


1401 


AAGCTCCATT 


GacaAACTGG 


1451 


ACGAACTGAA 


CCAAACTCTG 


1501 


TCGcctCAAT 


CGCCTATCTa 


1551 


GGACAAAACC 


TTAAAagacg 


1601 


aacCCaaCgc 


actGATTTtc 


1651 


GGAAGCCGAT 


AA 



AAAAGCGGCG AGGCAAAAGA CGTGTTCCAA 
TACCGCCATC GGGCAAAgcg GGCTGCGCTT 
AccgCATCCT CAACGTcaaC AGCCCTGTTT 
GGGCAAATCG AAAGCGCGCA TTTCGAcccG 
CACCATCTTC ATCCAAAGCC CCAACGACAA 
GTTTTTGGCT GGAAAGCGGC ATCAATATCG 
AAACTCAATT CCGCCCCTCT GCCTGCCCTG 
TGATTCGCCG AAAACCAAAA ACAGTAAAAA 
TCACGCTTTA CGACAGCCGC AGCGAAATCG 
TCGCTGTACT ACACCGCGTT TTTCAAACAA 
cggTTCGCCT GTcgaATACA AAGGGCtgaA 
TCCCTTATTT TGACCGCAAt gacagCCTGC 
aTTcccGtac gCATCCGCAT cgagccTTCC 
CGAGCAAAGC AAAGAGCATT GGAAACAACA 
AAGGCCTGAC CGCCACCATC TCCAGCAACA 
ATGATTGAGT TGAACGATCA GCCTTCCGCC 
TACCGTTTAT GCAGGCGATA CCGTCATCGC 
ATGACTTGCA GGTCAAATTG GCGGATTTGC 
CCATTggata aAACCGTTGC CGAATTGAAC 
GTCCGCACTC AAATCCGCCA ATGCCGCCCT 
TCGgcaaTCC GCAGACGCAA AACATCCCGA 
AAAGAGTTGC GCATAACCCT GCAAGGCGTA 
cgGAgacgta caAAATAcgc tgCaAAGTTT 
TtcaACCCGT CATTAACACT TTGAaAGAAa 
aacaACAGCA GCAAAGAccc tATCCCGAAA 



This corresponds to the amino acid sequence <SEQ ID 632; ORF 154.ng>: 

gl54.pep 

1 MTDNSPPPNG HAQARVRKNN TFLSAVWLVP LIALIAGGW L WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSGEAKDVFQ 

151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFMV GQIESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEIANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGM VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

351 RLEINADEQS KEHWKQQFQT ALNKGLTATI SSNNLLTGGK MIELNDQPSA 

401 SPKLRPHTVY AGDTVIATRG GGLDDLQVKL ADLLDKFNNL PLDKTVAELN 

451 GSIiAELKSAL KSANAALSSI DKLVGNPQTQ NIPNELNQTL KELRXTLQCV 

501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NNSSKDPIPK 

551 GSR* 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 633>: 

ml54.aeq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CRCGCCCAAG CACGCGTCCG 

51 CAAAAACAAC ACCTTCCTCT CTGCCGTCTG GCTGGTTCCG CTGATCGCGC 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 

151 GTGGTTACGC TCTTGATGGA CAGCGCGGAA GGCATTGAGG TCAACAATAC 

201 GGTCATCAAA GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTAA CCGCCCAACT CAATGCGGAC 

301 GTATCCGGCC TCATCCGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 

351 TATCGACCAA AGCGGCGTAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 

401 ACATCGCCTT TACACCCGGC AAAAGCGACG AGGCAAAAGA CGTGTTCCAA 

451 GTGCAGGACA TTCCGCCCGT TACCGCCATC GGGCAAAGCG GGCTGCGCTT 

501 GAATTTGATT GGTAAAAACG ACCGCATCCT CAACGTCAAC AGCCCTGTTT 

551 TGTATGAAAA TTTTATGGTC GGGCAAGTCG AAAGCGCGCA TTTCGACCCG 

601 TCCGACCAAA GCGTGCATTA CACCATCTTC ATCCAAAGCC CCAACGACAA 

651 ACTGATTCAT TCCGCCAGCC GTTTTTGGCT GGAAAGCGGC ATCAATATCG 

701 AAACCACAGG CAGCGGCATC AAACTCAATT CCGCCCCTCT GCCTGCCCTG 

751 CTGTCGGGCG CGATTTCATT TGATTCGCCG AAAACCAAAA ACAGTAAAAA 

801 CGTCAAAAGC GAAGACAGCT TCACGCTTTA CGACAGCCGC AGCGAAGTCG 

851 CCAACCTGCC TGACGACCGC TCGCTGTACT ACACCGCGTT TTTCAAACAA 

901 TCCGTGCGCG GCCTGACCGT CGGTTCGCCC GTCGAGTACA AAGGGCTGAA 

951 TGTCGGCGTG GTTTCCGACG TTCCTTATTT CGACCGCAAC GACAGCCTGC 

1001 ACCTGTTTGA AAACGGCTGG ATACCCGTAC GCATCCGCAT TGAACCTTCC 

1051 CGTTTGGAAA TCAATGCCGA CGAACAAAGC AAAGAACATT GGAAACAACA 

1101 ATTTCAGACG GCCTTAAACA AAGGCCTGAC CGCCACCATC TCCAGCAACA 

1151 ACCTGCTGAC CGGAAGCAAA ATGATTGAGT TGAACGATCA GCCTTCCGCA 
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1201 TCACCTAAGC TGCGACCX3CA TACCGTTTAT GCAGGCGATA CCGTTATCGC 

1251 GACCCAGGGC GGCGGTTTGG ACGATTTGCA GGTCAAATTG GCGGATTTGC 

1301 TGGACAAGTT CGACAAACTG CCTTTAGATA AGACGGTTGC CGAATTGAAC 

1351 GGTTCGCTTG CCGAGCTCAA ATCCACACTC AAATCTGCCA ATGCOGCCCT 

1401 AAGCTCCATC GACAAACTGG TCGGCAAACC GCAGACACAA AACATTCCGA 

1451 ACGAACTGAA CCAAACCCTG AAAGAGTTGC GCACAACCCT GCAAGGCGTA 

1501 TCGCCGCAAT CGCCTATCTA CGGCGACGTA CAAAATACGC TGCAAAGTTT 

1551 GGACAAAACT TTAAAAGACG TTCAACCCGT GATTAATACT TTGAAAGAAA 

1601 AACCCAACGC GCTGATTTTC AACAGCAGCA GCAAAGACCC TATCCCGAAA 

1651 GGAAGCCGAT AA 

This corresponds to the amino acid sequence <SEQ ID 634 ORF 154.a>: 

ml54.pep 

1 MTDNSPPPNG HAQARVRKNN TFLSAVWLVP LIALIAGGW L WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLIi SGSYIAFTPG KSDEAKDVFQ 

151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFKV GQVESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTCSGI KLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEVANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGV VSDVPYFDRN DSLHLPENGW IPVRIRIEPS 

351 RLEINADEQS KEHWKQQFQT ALNKGLTATI SSNNLLTGSK MIELNDQPSA 

401 SPKLRPHTVY AGDTVIATQG GGLDDLQVKL ADLLDKFDKL PLDKTVAELN 

451 GSLAELKSTL KSANAALSSI DKLVGKPQTQ NIPNELNQTL KELRTTLQGV 

501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NSSSKDPIPK 

551 GSR* 



xnl54 / gl54 97.8% identity in 553 aa overlap 



10 20 30 40 50 60 

ml54 .pep MTDNSPPPNGHAQARVRKOTTTFLSAVWLVPLIALIAGGWLWVKEIRNRGPVVTLLMDSAE 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

gl54 MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWVKEIRNRGPVVTLLMDSAE 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml54 .pep GIEVNNTVIKVLSIDVGRVTRIKLRDDQKGVBVTAQLNADVSGLIRSDTQFWVVKPRIDQ 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

gl 54 GI EVNNTVIKVLS IDVGRVTRI KLRDDQKGVEVTAQLNADVSGLI RSDTQFWWKPRIDQ 

70 80 90 100 110 120 



130 140 150 160 170 180 

inl54 .pep SGVTGIiGTLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNIilGKNDRIIiNVN 
llllllllllllllllllllll lllllllllllllllllllllllllllllllllllll 
gl54 SGVTGLGTLLSGSYIAFTPGKSGEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 54 -pep SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPNDKLIHSASRFWLESGINIETTGSGI 

lllllllllllhlllllllllllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

gl54 SPVLYENFMVGQIESAHFDPSDQSVHYTIFIQSPNDKLIHSASRPWLESGINIETTGSGI 
190 200 210 220 230 240 



250 260 270 280 290 300 

ml54 .pep KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 

llllillllllllllllllllllllllllllllllllllllNIIIIIIIIIIIIIIIII 

gl54 KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEIANLPDDRSLYYTAFFKQ 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml 5 4 . pep SVRGLTVGS PVEYKGLNVGWSDVPYFDRNDSLHLFENGWI PVR I RIEPSRLE INADEQS 

lllilllllllllllllllUllilllllllllllllllllllllllllllllllMIII 

gl54 SVRGLTVGSPVEYKGIiNVGMVSDVPYFDRNDSLHLFENGWI PVRIRIEPSRLEINADEQS 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 54 . pep KEHWKQQFQTALNKGLTATISSNNLLTGSKMIELNDQPSASPKLRPHTVYAGDTVIATQG 

gl 54 KEHWKQQFQTALNKGLTATISSNNLLTGGiCMIELNDQPSASPKLRPHTVYAGDTVIATRG 
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370 380 390 400 410 420 

430 440 450 460 470 480 

GGIiDDl/IVKIiADIiLDKroKLPIJ3KTVAELNGSLAELKSTLKSANAALSSIDKLVGKPQTQ 
llllllllllllllllh:||||llllllllllllll|:||lllllllll|||||:|||| 
GGIJ3DI/2VKUU}LI^KFKNLPLDKTVAELNGSLABLKSALKSANAALSSIDK^ 

430 440 450 460 470 480 

490 500 510 520 530 540 

NIPNEIJICyrLKBLRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 

IIIIIIIIIMIII IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

NIPNELNQTLKELRITLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 
490 500 510 520 530 540 

550 

NSSSKDPIPKGSRX 

hlllllllllMI 

NNSSKDPIPKGSRX 
550 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 635>: 

al54 . seq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCCCAAG CACGCGTCCG 

51 CAAAAACAAC ACCTTCCTCT CTGCCGTCTG GCTGGTTCCG CTGATCGCGC 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 

151 GTGGTTACGC TCTTGATGGA CAGCGCGGAA GGCATTGAGG TCAACAATAC 

201 GGTCATCAAA GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTAA CCGCCCAACT CAATGCGGAC 

301 GTATCCGGCC TCATCCGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 

351 TATCGACCAA AGCGGCGTAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 

401 ACATCGCCTT TACACCCGGC AAAAGCGACG AGGCAAAAGA CGTGTTCCAA 

451 GTGCAGGACA TTCCGCCCGT TACCGCCATC GGGCAAAGCG GGCTGCGCTT 

501 GAATTTGATT GGTAAAAACG ACCGCATCCT CAACGTCAAC AGCCCTGTTT 

551 TGTATGAAAA CTTTATGGTC GGGCAAGTCG AAAGCGCGCA TTTCGACCCG 

601 TCCGACCAAA GCGTGCATTA CACCATCTTC ATCCAAAGCC CCAACGACAA 

651 ACTGATTCAT TCCGCCAGCC GTTTCTGGCT GGAAAGCGGC ATCAATATCG 

701 AAACCACAGG CAGCGGCATC AAACTCAATT CCGCCCCTCT GCCTGCCCTG 

751 CTGTCGGGCG CGATTTCATT TGATTCGCCG AAAACCAAAA ACAGTAAAAA 

801 CGTCAAAAGC GAAGACAGCT TCACGCTTTA CGACAGCCGC AGCGAAGTCG 

851 CCAACCTGCC TGATGACCGT TCGCTGTACT ACACCGCGTT TTTCAAACAA 

901 TCCGTGCGCG GACTGACCGT CGGTTCGCCT GTCGAGTACA AAGGGCTGAA 

951 TGTCGGCGTG GTTTCCGATG TTCCTTATTT CGACCGCAAC GACAGCCTGC 

1001 ACCTGTTTGA AAACGGCTGG ATTCCCGTAC GCATCCGTAT TGAGCCTTCC 

1051 CGTTTGGAAA TCAATGCCGA CGAACAAAGC AAAGAACATT GGAAACAACA 

1101 ATTTCAGACG GCCTTAAACA AAGGCCTGAC CGCCACCATC TCCAGCAACA 

1151 ACCTGCTGAC CGGCAGCAAA ATGATTGAGT TGAACGATCA GCCTTCCGCC 

1201 TCGCCCAAGC TGCGACCGCA TACCGTTTAT GCAGGCGATA CCGTTATCGC 

1251 GACCCAGGGC GGCGGTTTGG ACGATTTGCA GGTCAAATTG GCGGATTTGC 

1301 TGGACAAGTT CGACAAACTG CCTTTAGATA AGACGGTTGC CGAATTGAAC 

1351 GGTTCGCTTG CCGAGCTCAA ATCCACACTC AAATCTGCCA ATGCCGCCCT 

1401 AAGCTCCATC GACAAACTGG TCGGCAAACC GCAGACACAA AACATTCCGA 

1451 ACGAACTGAA CCAAACCCTG AAAGAGTTGC GCACAACCCT GCAAGGCGTA 

1501 TCGCCTCAAT CGCCTATCTA CGGCGACGTA CAAAATACGC TGCAAAGTTT 

1551 GGACAAAACC TTAAAAGACG TTCAACCCGT CATTAACACT TTGAAAGAAA 

1601 AACCCAACGC GCTGATTTTC AACAGCAGCA GCAAAGACCC TATCCCGAAA 

1651 GGAAGCCGAT AA 

This corresponds to the amino acid sequence <SEQ ID 636; ORF 154.a>: 

al54 .pep 

1 MTDNSPPPNG HAQARVRKNN T FLSAVWLVP LIALIAGG WL WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSDEAKDVFQ 



ml54 .pep 
gl54 

ml54 .pep 
gl54 

ml54 .pep 
gl54 
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151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFMV GQVESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEVANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGV VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

351 RLEINADEQS KEHWKQQFQT ALNKGLTATI SSNNLLTGSK MIELNDQPSA 

401 SPKLRPHTVY AGDTVIATQG GGLDDLQVKL ADLLDKFDKL PLDKTVAELN 

451 GSLAELKSTL KSANAALSSI DKLVGKPQTQ NIPNELNQTL KELRTTLQGV 

501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NSSSKDPIPK 

551 GSR* 

ml 54/al 54 1 00.0% identity in 553 aa overlap 

10 20 30 40 50 60 

ml54 . pep MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWVKEIRNRGPWTLLMDSAE 
I I I I I t I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I M I f I I I I I 
al54 MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWVKEIRNRGPWTLLMDSAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml54 . pep GIEVNNTVIKVLSIDVGRVTRIKLRDDQKGVEVTAQLNADVSGLIRSDTQFWWKPRIDQ 
M I I I I I [ I M t I I I I t I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I )! I I I I I I I I I I I 
al54 GIEVNNTVIKVLSIDVGRVTRIKLRDDQKGVEVTAQLNADVSGLIRSDTQFWWKPRIDQ 

'?0 80 90 100 110 120 

130 140 150 160 170 180 

ml54 . pep SGVTGLGTLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 
I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I i I I I I I I I I I I I t I I I I I I I I I 
al54 SGVTGLGTLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml54.pep SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPNDKLIHSASRFWLESGINIETTGSGI 
I I I f I M I I M M I I I I I I I I I I I I I I I I I i I I I I I I i I I I I I I I M I I I 1 I I I I I I I I I 
al54 SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPNDKLIHSASRFWLESGINIETTGSGI 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml54.pep KLNSAPLPALLSGAISFDSPKTICNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 

I I I I I I I I I I I I I I I I I i I M I I I I I I I I I I I I I I I t I I I I I I i I I I I I I I I I I 

al54 KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml54.pep SVRGLTVGSPVEYKGLNVGWSDVPYFDRN DSLHLFENGW I PVR IRIEPSRLEINADEQS 

IIIMItMllllllllllllllllllllllll] Illllllllltillttlllll 

al54 SVRGLTVGSPVEYKGLNVGWSDVPYFDRNDSLHLFENGWIPVRIRIEPSRLEINADEQS 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml54 , pep KEHWKQQFQTALNKGLTATISSNNLLTGSKMIELNDQPSASPKLRPHTVYAGDTVIATQG 
IIIMIIIIMIIIIIIIillillllilMlilllllllllllllllltlllllllllll 
al54 KEHWKQQFQTALNKGLTATISSNNLLTGSKMIELNDQPSASPKLRPHTVYAGDTVIATQG 

370 380 390 400 410 420 

430 440 450 460 470 480 

inl54 . pep GGLDDLQVKLADLLDKFDKLPLDKTVAELNGSIAELKSTLKSANAALS SI DKLVGKPQTQ 

I I I I M M I M I I I M I I I I I I I I I I I M I I I I I I I I I I i I I I M I I I I I I I I I I I I I I I 
al54 GGLDDLQVKLADLLDKFDKLPLDKTVAELNGSLAELKSTLKSANAALSSIDKLVGKPQTQ 

430 440 450 460 470 480 

490 500 510 520 530 540 

ral54 . pep NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 

tllllllilllllM lillllilllllllf lllllllf IIIIIIIIMtl 

al54 NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 

490 500 510 520 530 540 
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550 

ml54 .pep NSSSKDPIPKGSRX 
I I I I I I I I I I I I I I 
al54 NSSSKDPIPKGSRX 
550 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 637>: 

gl55 . seq 

1 atCAAaatcg GtatcCCACG CGAGTCAtta tcCGGCGAAA cccgcgtagc 

51 ctgcAcgccc gCCACCGTTG CCctgctggg caAactAGGC TTTGAAACCG 

101 TTGtcgaAAG CGGTGCAggt TTGGCGGCAA GTTTggaCGA TGCCGCTTAC 

151 CAAACAGCAG GCGCAACCGT TGCCGACAAA GCGGCGGTTT GGGCCTGCCC 

201 TTTAATTTAT AAGGTCAACG CGCCGTCCGA AGGCGAGCTG CCGCTGCTCA 

251 AAGAAGGTCA AACCATCGTC AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 

301 TTGGTCGAGG CCTTGCGCGC CAAGAAAGTC AACGCGCTGG CGATGGACAT 

351 GGTTCCCCGC ATTTCCCGCG CTCAGGCCTT GGACGCTTTG TCTTCAATGG 

4 01 CAAACATCAG CGGCTACCGC GCCGTGATTG AAGCCGCCAA CGCCTTCGGC 

4 51 CGTTTCTTCA CCGGTCAAAT CACTGCCGCC GGCAAAGTGC CGCCTGCGCA 

501 GGTTTTGGTG ATTGGCGCCG GTGTGGCGGG TTTGGCGGCA ATCGGTACGG 

551 CAAATTCGCT CGGCGCAGTG GTGCGCGCGT TCGATACCCG CTTGGAAGTG 

601 GCGGAACAAA TCGAATCGAT GGGCGGTAAG TTcctGAAAC TCGACTTCCT 

651 GCAAGAATCG GGCGGCAGCG GAGACGgctA CGCCAAAGTG ATGAGCGACG 

701 AATTTATCGC CGCCGAAATG AAGCTCTTTG CCGAACAGGC GAAAGAAGTG 

751 GACATCATCA TCACCACCGC CGCCATTCCG GGCAAACCCG CTCCCAAGCT 

801 GATTACCAAA GAAATGGTGG AAAGCATGAA ATCCGGATCC GTCATCGTCG 

851 ATTTGGCGGC GACGGGCGGC AACTGCGAAC TCACCCGACC GGGCGAATTG 

901 TCCGTAACCG GCAACGGCGT GAAAATCATC GGCTACACCG ACATGGCAAA 

951 CCGCCTTGCC GGACAGTCTT CCCAGCTTTA CGCCACCAAC TTGGTGAACC 

1001 TGACCAAGCT GTTAAGCCCG AACAAAGAcg gcgaAATCAC GCTGGACTTC 

1051 GAAGacgtGA TTATCCGCAA TATGACCGTT ACCCGcgacg gcgaaATCAC 

1101 CTTCCCGCCT CCGccgaTTc aggtTTCcgc ccggccgCAG CAAAcgccgt 

1151 ctgaAAAagc cgcGCCTGCC GCCAagcccg AgccGaaacc tgttCCcctg 

1201 tggaAAAaac tcgCGCCCGC CGCcatcgCC GCCGTATTGG tgctgtgGgt 

1251 cggCgcggtc gcacccgcag CATTCTTGAA CCACTTTATC GTCTTCGTCC 

1301 TCGCCTGCGT CATCGGCTAC CATGTCGTTT GgaacgTCAG CCACTCGCTG 

1351 CACACACCGC TGAtgtcggt aaccaaCgcc atctccGGCA tcatggtcgt 

1401 cggCGCGCTG CTGCAAATCG GTCAGGGcaa cggcttcgtT TCgctGCTGT 

1451 CGTTTGTTGC CATCCTGATT GCCGGCATCA ATATCTTCGG CGGCTTTGCG 

1501 GTTACACGGC GTATGCTGAA TATGTTTAAG AAAGGGTAA 

This corresponds to the amino acid sequence <SEQ ID 638; ORF 155.ng>: 

gl55.pep 

1 MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QTAGATVADK AAVWACPLIY KVNAPSEGEL PLLKEGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDAL SSMANISGYR AVIEAANAFG 

151 RFFTGQITAA GKVPPA QVLV IGAGVAGLAA IGTAN SLGAV VRAFDTRLEV 

201 AEQIESMGGK FLKLDFLQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKLITK EMVESMKSGS VIVDLAATGG NCELTRPGEL 

301 SVTGNGVKII GYTDMANRLA GQSSQLYATN LVNLTKLLSP NKDGEITLDF 

351 EDVIIRNMTV TRDGEITFPP PPIQVSARPQ QTPSEKAAPA AKPEPKPVPL 

401 WK KLAPAAIA AVLVLWVGAV AP 7VAFLNHFI VFVLACVIG Y HWWNVSHSL 

451 HTP LMSVTNA ISGIMWGAL LQIGQGNG FV SLLSFVAILI AGINI FGGFA 

501 VTRRMLNMFK KG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 639>: 

ml55 . seq 

1 ATGAAAATCG GTATCCCACG CGAGTCATTA TCCGGCGAAA CCCGCGTCGC 

51 CTGTACGCCC GCCACCGTCG CCCTGCTGGG CAAACTGGGC TTTGAAACCG 

101 TTGTCGAAAG CGGTGCAGGT TTGGCGGCAA GTTTGGACGA TGCCGCTTAC 

151 CAAACAGCAG GCGCAACCGT TGCCGACAAA GCGGCGGTTT GGGTCTGCCC 

201 TTTGATTTAT AAGGTCAACG CGCCGTCCGA ACAGGAACTG CCGCTTTTGA 

251 ACGAAGGTCA AACCATCGTC AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 
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301 TTGGTCGAAG CCTTGCGCGC CAAGAAAGTG AACGCGCTGG CGATGGATAT 

351 GGTGCCCCGC ATTTCGCGCG CGCAGGCTTT GGACGCTTTG TCTTCGATGG 

401 CAAACATCAG CGGCTACCGC GCCGTAATTG AAGCCGCCAA CGCCTTCGGC 

451 CGTTTCTTCA CCGGTCAAAT TACCGCCGCC GGCAAAGTGC CGCCCGCGCA 

501 GGTTTTGGTG ATTGGTGCAG GTGTGGCAGG TTTGGCGGCG ATCGGTACGG 

551 CAAACTCGCT CGGCGCAGTG GTACGCGCGT TCGATACCCG CTTGGAAGTG 

601 GCGGAACAAA TCGAATCGAT GGGCGGCAAG TTCCTGAAAC TCGACTTCCC 

651 ACAAGAATCG GGCGGCAGCG GAGACGGCTA CGCCAAAGTG ATGAGCGACG 

701 AATTTATCGC AGCCGAGATG AAGCTCTTTG CCGAGCAGGC GAAAGAAGTG 

751 GACATCATCA TCACCACCGC CGCCATTCCG GGCAAACCCG CGCCCAAGCT 

801 GATTACCAAA GAAATGGTGG AAAGCATGAA ATCCGGCTCC GTCATCGTCG 

851 ATTTGGCGGC GGCGACGGGC GGCAACTGCG AACTCACCCG CCCGGGCGAA 

901 TTGTCCGTAA CCGGCAACGG CGTGAAAATC ATCGGCTACA CCGACATGGC 

951 AAACCGCCTT GCCGGACAGT CTTCCCAGCT TTACGCCACC MCTTGGTCA 

1001 ACCTGACCAA GCTGTTAAGC CCGAACAAAG ACGGCGAAAT CACGTTGGAC 

1051 TTCGAAGACG TGATTATCCG CAACATGACC GTTACCCACG ACGGCGAAAT 

1101 CACCTTCCCG CCTCCGCCGA TTCAAGTTTC CGCCCAGCCG CAGCAAACGC 

1151 CGTCTGAAAA AGCCGTGCCT GCCGCCAAGC CCGAGCCAAA ACCCGTTCCC 

1201 CTGTGGAAAA AACTCGCGCC CGCCGTCATC GCCGCCGTCT TGGTACTGTG 

1251 GGTCGGCGCG GTCGCACCCG CAGCATTCCT GAACCACTTT ATCGTGTTCG 

1301 TTCTCGCCTG CGTCATCGGC TACTACGTCG TCTGGAACGT CAGCCACTCG 

1351 CTGCACACAC CGCTGATGTC GGTAACCAAC GCCATCTCCG GCATCATCGT 

1401 GGTCGGCGCG CTGCTGCAAA TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 

1451 TGTCGTTTGT TGCCATCCTG ATTGCCGGCA TCAACATCTT CGGCGGCTTT 

1501 GCGGTAACAC GGCGTATGCT GAATATGTTT AAGAAAGGGT AA 



This corresponds to the amino acid sequence <SEQ ID 640; ORF 155>: 

ml 55. pep 

1 MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QTAGATVADK AAVWVCP LIY KVNAPSEQEL PLLW EGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDAL SSMANISGYR AVIEAANAFG 

151 RFFTGQITAA GKVPPAQVLV IGAGVAGLAA IGTANSLGAV VRAFDTRLEV 

201 AEQIESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKLITK EMVESMKSGS VIVDLAAATG GNCELTRPGE 

301 LSVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTHDGEITFP PPPIQVSAQP QQTPSEKAVP AAKPEPKPVP 

401 LWKK LAPAVI AAVLVLWVGA VA P AAFLNHF IVFVLACVIG YYWWNVSHS 

451 LHTP LMSVTN AISGIIWGA L LQIGQGNG F VSLLSFVAIL lAGINI FGGF 

501 AVTRRMLNMF KKG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 155 shows 97.9% identity over a 513 aa overlap with a predicted ORF (ORF 155.ng) 
from N. gonorrhoeae: 

ml55 / gl55 97.9% identity in 513 aa overlap 

10 20 30 40 50 60 

ml55 . pep MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 
I I I I I I t i I I I I I I I I I I I I I I I I I I I I I I i 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gl55 MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 55 . pep AAVWVCPLIYKVNAPSEQELPLLNEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 

illl:IMIilllllll i I I I I : I I I I I I I I I I i I I I It I I I I I I I I I I I I I II 

gl55 AAVWACPLIYKVNAPSEGELPLLECEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml55.pep ISRAQALDALSSMANISGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLVIGAGVAGLAA 
I I I M I I I M I II I I I I II I I I I I It 1 I II I I I t I I I II t I I I I I I I I I II I I I I I I II I 
gl55 ISRAQALDALSSMANISGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLVIGAGVAGLAA 
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130 140 150 160 170 180 

190 200 210 220 230 240 

ml55 . pep IGTANSLGAWRAFDTRLEVAEQIESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 

IIIIIIIIIIIIIMIIIIIIIIMM Ill I I I I I I I 1 I I I I I I I I I 1 ] I I ) I 

gl55 IGTANSLGAWRAFDTRLEVAEQIESMGGKFLKLDFLQESGGSGDGYAKVMSDEFIAAEM 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml55 . pep KLFAEQAKEVDIIITTAAIPGKPAPKLITKEMVESMKSGSVIVDLAAATGGNCELTRPGE 
I I I t I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I 1 t I I I I I I It I I I I I I I I I I I I 
gl55 KLFAEQAKEVDIIITTAAIPGKPAPKLITKEMVESMKSGSVIVDLAA-TGGNCELTRPGE 

250 260 270 280 290 



310 320 330 340 350 360 

ml55 . pep LSVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 
I i I I i I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I i i I I I I 
gl55 LSVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 
300 310 320 330 340 350 



370 380 390 400 410 420 

ml55.pep VTHDGEITFPPPPIQVSAQPQQTPSEKAVPAAKPEPKPVPLWKKLAPAVIAAVLVLWVGA 
I i : I I I I I M I I I I I i I I : I I I I I I I I I : I I 1 i I I I I I I I I I I I I I I I : I I I I I I ] 1 I I I 
gl55 VTRDGEITFPPPPIQVSARPQQTPSEKAAPAAKPEPKPVPLWKKLAPAAIAAVLVLWVGA 
360 370 380 390 400 410 



430 440 450 460 470. 480 

ml55 . pep VAPAAFLNHFIVFVLACVIGYYWWNVSHSLHTPLMSVTNAISGIIWGALLQIGQGNGF 
I I I I I I I I I I I I I I I I I I I I I : I i I I i I 1 I I I I I t I I I I I I I I I I : I I I M I I I I I I I I I 
gl55 VAPAAFLNHFIVFVLACVIGYHWWNVSHSLHTPLMSVTNAISGIMWGALLQIGQGNGF 
420 430 440 450 460 470 



490 500 510 

ml55 . pep VSLLSFVAILIAGINIFGGFAVTRRMLNMFKKGX 
I I I I I I I I 1 I I I I I I I I I I I t I I I I I I I I I I I I I 
gl55 VSLLSFVAILIAGINIFGGFAVTRRMLNMFKKGX 
480 490 500 510 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 641>: 

al55 . seq 

1 ATGAAAATCG GTATCCCACG TGAGTCATTA TCCGGCGAAA CCCGCGTCGC 

51 CTGTACGCCC GCCACCGTCG CCCTGCTGGG CAAACTGGGC TTTGAAACCG 

101 TTGTCGAAAG CGGCGCAGGT TTGGCGGCAA GTTTGGACGA TGCCGCTTAC 

151 CAAGCAGCAG GCGCAACCGT TGCCGACAAA GCAGCGGTTT GGGCATACCC 

201 TTTAATTTAT AAGGTTAACG CGCCGTCCGA AGACGAGCTG CCGCTGCTCA 

251 AAGAAGGACA GACCATCGTC AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 

301 TTGGTCGAAG CCTTGCGCGC CAAGAAAGTG AACGCGCTGG CAATGGACAT 

351 GGTGCCCCGC ATTTCGCGCG CGCAGGCTTT GGACGNTTTG TCTTNGATGG 

401 CAAACATCAG CGGCTACCGC GCCGTGATTG AAGCCGCCAA CGCCTTCGGC 

451 CGTTTNTTCA CCGGCCAAAT TACTGCCGCA GGCAAAGTGC CGCCCGCGCA 

501 GGTTTTGGTG ATTGGTGCAG GTGTGGCAGG TTTGGCGGCG ATCGGTACGG 

551 CAAACTCGCT CGGCGCAGTG GTACGCGTGT TCGATACCCG CCTG.T^GTG 

601 GCGGAACAAT TAGAATCGAT GGGCGGCAAG TTCCTGAAAC TCGACTTCCC 

651 GCAAGAATCG GGCGGCAGCG GCGACGGCTA CGCCAAAGTG ATGAGCGACG 

701 AATTTATCGC CGCCGAGATG AAGCTTTTTG CCGAGCAGGC GAAAGAAGTG 

751 GACATCATCA TCACCACCGC CGCCATTCCG GGCAAACCCG CGCCCAAGCN 

801 NNTNANCAAA GAAATGGTCG AAAGCATGAA ACCCGGCTCC GTCATCGTCG 

851 ATTTGGCGGC GGCGACGGGC GGCAACTGCG AACTCACCAA ACAGGGCGAA 

901 TTGTTCGTAA CCGGCAACGG CGTGAAAATC ATCGGCTACA CCGACATGGC 

951 AAACCGCCTT GCCGGACAGT CTTCGCAGCT TTACGCCACC AACTTGGTCA 

1001 ACCTGACCAA GCTGTTAAGC CCGAACAAAG ACGGCGAAAT CACGCTGGAC 

1051 TTCGAAGACG TGATTATCCG CAACATGACC GTTACCCGCG ACGGCGAAAT 

1101 CACCTTCCCG CCTCCGCCGA TTCAAGTTTC CGCCCAACCG CAGCAAACGC 

1151 CGTCTGAAAA AGCCGCGCCT GCCGCCAAGC CCGAACCGAA ACCCGTTCCC 



wo 99/57280 




PCTAJS99/09346 



447 



1201 CTGTGGAAAA AACTCGCGCC CGCCNTNATC GCCGCCGTGT TGGTACTGTG 

1251 GGTCGGCGCG GTCGCACCCG CAGCATTCCT GAACCACTTT ATCGTCTTCG 

1301 TCCTCGCCTG CGTCATCGGC TACTATGTCG TTTGGAACGT CAGCCACTCG 

1351 CTGCACACAC CGCTGATGTC GGTGACCAAC GCCATTTCCG GCATCATCGT 

1401 GGTCGGCGCG CTGCTGCAAA TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 

1451 TGTCGTTTGT TGCCATCCTG ATTGCCAGCA TCAACATCTT CGGCGGCTTC 

1501 TTTGTAACGC GGCGGATGCT GAATATGTTT AGGAAAGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 642; ORF 155.a>: 

alSS.pep 

1 MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QAAGATVADK AAVWAYPLIY KVNAPSEDEL PLLKEGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDXL SXMANISGYR AVIEAANAFG 

151 RXFTGQITAA GKVPPA QVLV IGAGVAGLAA IGTA NSLGAV VRVFDTRLXV 

201 AEQLESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKXXXK EMVESMKPGS VIVDLAAATG GNCELTKQGE 

301 LFVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTRDGEITFP PPPIQVSAQP QQTPSEKAAP AAKPEPKPVP 

401 LWK KLAPAXI AAVLVLWVGA V AP AAFLNHF IVFVLACVIG YYWWNVSHS 

4 51 LHTP LMSVTN AISGIIWGA L LQIGQGNG F VSLLSFVAIL lASINI FGGF 

501 FVTRRMLNMF RKG* 

ml55/al55 95.3% identity in 513 aa overlap 

10 20 30 40 50 60 

inl55 . pep MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 
I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I t I I I I It i I I I t I I I I : I I I i I I I I 
al55 MKXGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQAAGATVADK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml55 . pep AAVWVCPLIYKVNAPSEQELPLLNEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 
Mil: I I I I I I I I I I I : I II M: I I I I I I I I I I I I I [ I I I I 11 I I I i I i II I I I I I I I I 
al55 AAVWAYPLIYKVNAPSEDELPLLKEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml55 . pep ISRAQALDALSSMANISGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLVIGAGVAGLAA 
I M I II II II II I II M I I I II II II t i I I I I I I I I I I I M I I I i M I I I I I I I I I I 
al55 ISRAQALDXLSXMAh?ISGYRAVIE7\ANAFGRXFTGQITAAGKVPPAQVLVIGAGVAGLAA 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml55 . pep IGTANSLGAWRAFDTRLEVAEQIESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 
t II M I I I I I I I : I I M I llll:lllllllllllllllllllllllllillllllllll 
al55 IGTANSLGAWRVFDTRLXVAEQLESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 

190 200 210 220 230 240 

250 260 270 280 290 300 

miss . pep KLFAEQAKEVDIIITTAAIPGKPAPKLITKEMVESMKSGSVIVDLAAATGGNCELTRPGE 

t I I I t I I I I I I I I I I I I I I 1 I M I I I : I M I I I I i I I M I I I I I Ill: II 

al55 KLFAEQAKEVDIIITTAAIPGKPAPKXXXECEMVESMKPGSVIVDLAAATGGNCELTKQGE 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml55 . pep LSVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 
I I t I I I I I I I I I I t t I I I M I I I I I I I II I I I I I I i I I i I I I I I I I I I I M I tl t I I I I 
al55 LFVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 55 . pep VTHDGEITFPPPPIQVSAQPQQTPSEKAVPAAKPEPKPVPLWKKLAPAVI AAVLVLWVGA 

ll:IMIIIIIIIIIIII]lllllllll:lllllllltlllllll)M IIIIIIIIIM 
al55 VTRDGEITFPPPPIQVSAQPQQTPSEKAAPAAKPEPKPVPLWKKLAPAXIAAVLVLWVGA 

370 390 390 400 410 420 
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430 440 450 460 470 480 

ml 55 . pep VAPAAFLNHFIVFVLACVIGYYWWNVSHSLHTPLMSVTNAISGIIWGALLQIGQGNGF 
I I I I I I M I I It I I I I I I I I I I 1 I I 1 t I I I I I I I I I I t I I I I I I I I I I I ) I I I I I t I I I I 
al55 VAPAAFLNHFIVFVLACVIGYYWWNVSHSLHTPLMSVTNAISGIIWGALLQXGQGNGF 

430 440 450 460 470 480 

490 500 510 

ml55 .pep VSLLSFVAILIAGINIFGGFAVTRRMLNMFKKGX 
111111111111:1111111 111111111:111 
al55 VSLLSFVAILIASINIFGGFFVTRRMLNMFRKGX 

490 500 510 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 643>: 

gl56.seq 

1 ATGACTTTCG CCTATTGGTG CATTCTGATT GCCTGCCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCTCG CGGTTTTCTG GCACATACGC AAGGCGCAGC CGCCCGTGCC 

151 CACGCCGCGC AGCAA7VACGG TTTTGAAGCC TTTGCACCGT TTGCCGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC CGGACAAGCA ACCGTCAACA 

251 CGCTTGCCGG ATTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAC 

301 ATCGCAGACA AAGCAGCATT GCGCTCGCTG ATGTGGGCGG GCGGATTTGC 

351 CTGCACCGTC GGACTGTTTG TCGCGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 644; ORF 156.ng>: 

gl56.pep 

1 MTFAYWCILI ACLLPLFCAA YAKKAGGFRF KDNHNPRGFL AHTQGjVAARA 

51 HAAQQNGFEA FAPFAAAVLT AHATGNAGQA TVNT IAGLFI LFRLAFIWCY 

101 lADKAALRS L MWAGGFACTV GLFVAAA * 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 645>: 

ml56. seq 

1 ATGACTTTCG CCTATTGGTG TATTCTGATT GCCTGCCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCGCG CGGTTTTCTA GCGCACACGC AAGGCGCAGC CGCCCGTGCC 

151 CACGCCGCAC AGCAAAACGG TTTTGAAGCC TTTGCACCGT TTGCCGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC GGCGCAATCG ACCATCAACA 

251 CGCTTGCCTG CCTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAT 

301 ATCGCCGACA AAGCCGCTAT GCGCTCACTG ATGTGGGCAG GCGGATTTGC 

351 CTGCACCGTC GGGCTGTTTG TCGCGGCTGC TTGA 

This corresponds to die amino acid sequence <SEQ ED 646; ORF 156>: 

ml56.pep 

1 MTFAYWCILI ACLLPLFCAA YAKKAGGFRF KDNHNPRGFL AHTQGAAARA 

51 HAAQQNGFEA FAPFAAAVLT AHATGNAAQS TINT LACLFI LFRLAFIWCY 

101 lADKAAMRS L MWAGGFACTV GLFVAA A* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 



ml56 / gl56 96.1% identity in 127 aa overlap 

10 20 30 40 50 60 

ml56 . pep MTFAYWCILIACLLPLFCAAYAKKAGGFRFKDNHNPRGFLAHTQGA7W\RAHAAQQNGFEA 
IIMIMIIIllMlitllllllllillltMIIIIIIIIIMMIIIIIIIIIIIIIII 
gl56 MTFAYWCILIACLLPLFCAAYAKKAGGFRFKDNHNPRGFLAHTQGAAARAHAAQQNGFEA 

10 20 30 40 50 60 



nil56.pep 



70 80 90 100 110 120 

FAPFAAAVLT/VHATGNAAQSTINTLACLFILFRLAFIWCYIADKAAMRSLMWAGGFACTV 
MIIIIMIIIIII|I|:|:|:M)I I I I I I I I I I I I I ) I I I I I I : I I I I I I I I j | | ] | 
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gl56 



FAPFAAAVLTAHATGNAGQATVNTLAGLFILFRLAFIWCYIADKAALRSLMWAGGFACTV 
70 80 90 100 110 120 



ml56,pep 
gl56 



GLFVAAAX 
I I I I I I I I 
GLFVAAAX 



The following partial DNA sequence was identified in K meningitidis <SEQ E) 647>: 



al56. seq 
1 
51 
101 
151 
201 
251 
301 
351 



ATGACTTTCG 
TTGTGCGGCG 
ACAATCCGCG 
CACGCCGCGC 
CGTTTTGACG 
CGCTTGCCGG 
ATCGCAGACA 
CTGCACCGTC 



CCTATTGGTG 
TATGCCAAAA 
CGATTTTCTG 
AGCAAMCGG 
GCACACGCAA 
CCTGTTCATC 
AAGCAGCATT 
GGGCTGTTTG 



TATTCTGATT 
AAGCGGGCGG 
GCGCGCACGC 
JTTTGAAGCC 
CCGGCAATGC 
CTGTTCCGCC 
ACGCTCGCTG 
TCGTGGCTGC 



GCCTACCTAT 
ATTCCGGTTT 
AAGGCACAGC 
TTTGCACCGT 
CGGACAAGCA 
TCGCCTTTAT 
ATGTGGGTGG 
TTGA 



TGCCGCTTTT 
AAAGACAACC 
CGCCCGTGCC 
TTGCAGCCGC 
ACCGTCAACA 
CTGGTGCTAC 
GCGGATTTGT 



This corresponds to the amino acid sequence <SEQ ID 648; ORF 156.a>: 

al56 .pep 

1 MTFAYWCILI AYLLPLFCAA YAKKAGGFRF KDNHNPRDFL ARTQGTAARA 

51 HAAQ QNGFEA FAPFAAAVLT AHATGNAGQA TVNT LAGLFI LFRLAFIWCY 

101 lADKAALRS L MWVGGFVCTV GLFWAA * 

ml56/al56 90.6% identity in 127 aa overlap 

10 20 30 40 50 60 

mlSe.pep MTFAYWCILIACLLPLFCAAYAKKAGGFRFKDNHNPRGFLAHTQGAAARAHAAQQNGFEA 
I I i I I I I I 11 I I I I I I I I I i I I I I M I I I ! I I I I I I I 1 I : I I I : I I I t I I I I I I I I I i 
al56 MTFAYWCILIAYLLPLFCAAYAKKAGGFRFKDNHNPRDFLARTQGTAARAHAAQQNGFEA 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml56 . pep FAPFAAAVLTAHATGNAAQSTINTLACLFILFRLAFIWCYIADKAAMRSLMWAGGFACTV 
I I i I I M I I i I I ) I I I I : I : ) : I I I I I I I I I I t i I I I I I i I I I I I : I I I I I : I I I : I I I 
al56 FAPFAAAVLTAHATGNAGQATVNTLAGLFILFRLAFIWCYIADKAALRSLMWVGGFVCTV 

70 80 90 100 110 120 



ml 5 6 . pep GLFVAAAX 
1111:111 
al56 GLFWAAX 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 649>: 

gl57 . seq 

1 atgaggaacg aggAAAAACg cgccctgcgc cgcgaattgC gCgGgcggcg 

51 ttcgcAAATg GGgcgagacg tGCGggCGGC GGCGgCgatA Aaaatcaacc 

101 gcctgctcaa aCGTtatatc AAGCGCggtc gGaAaatcgG CGTGTATTgg 

151 cCGATGGGCA AGGAATTGcg TTTGGGCGgc tTtgtcCGCG CGGCGC7VAAA 

201 ACGCgGCGCA AAactCtatC tgccttATAT CGAACCGCAC ACGCGGCGGA 

251 TGTGGTTTAC GCCGTATCCT GAACGCGGAA TGGAACGGGA ACGCAAGCGC 

301 GGTAGGGCGA AGCTGCATGT CCCTCAGTTT GCAGGGCGCA AAATCCGCGT 

351 GCACGGTTTG TCGGTATTGC TCGTCCCGCT TGTCGGCATA GACCGCGAAG 

401 GCTACCGTTT GGGGCAGGCA GGCGGCTATT ACGATGCGAC GCTTTCGGCG 

451 ATGAAATACC GTTTGCAGGC GAA7UVCCGTG GGCGTGGGCT TTGCCTGCCA 

501 GTTGGTGGAC AGGCTCCCAC GCGAGGCGCA CGACCTGCCG CTGGACGGTT 

551 TTGTATCGGA AGCGGGGATA TTGTGTTTTT AG 
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This corresponds to the amino acid sequence <SEQ ID 650; ORF I57.ng>: 

gl57.pep 

1 MRNEEKRALR RELRGRRSQM GRDVRAAAAI KINRLLKRYI KRGRKIGVYW 

51 PMGKELRLGG FVRAAQKRGA KLYLPYIEPH TRRMWFTPYP ERGMERERKR 

101 GRAKLHVPQF AGRKIRVHGL SVLLVPLVGI DREGYRLGQA GGYYDATLSA 

151 MKYRLQAKTV GVGFACQLVD RLPREAHDLP LDGFVSEAGI LCF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 651>: 

ml57 . seq 

1 ATGAGGAACG AGGT^AAAACG CGCCCTGCGC CGCGAATTGC GCGGGCGGCG 

51 TTCGCAAATG GGGCGGGACG TGCGGGCGGC GGCAACGGTA AAAATCAACC 

101 ACCTGCTCAA ACGTTATATT AAAAAAGGGC GGAAAATCGG CGTGTATTGG 

151 CCGATGGGCA AGGAATTGCG TTTGGACGGC TTTGTCCGCG CGGCGCAAAA 

201 ACGCGGTGCG GAACTCTACC TGCCTTATAT CGAACCGCGT TCGCGGCGGA 

251 TGTGGTTTAC GCCGTATCCT GCCGATGGAG TAAAACAAGA ACGCAAGCGC 

301 GGTAGGGCGA AGCTGCATGT CCCTCAGTTT GCAGGTCGGA AAAAGCGTGT 

351 GCATGATTTG AACCTCCTGC TTGTGCCAGT GGTCGGTATG GACAGGCTGG 

401 GCTACCGCTT GGGACAGGCA GGCGGCTATT ACGATGCGAC GCTTTCAGCG 

451 ATGAAATACC GTTTGCAGGC AAAAACCGTG GGCGTGGGCT TTGCCTGCCA 

501 GTTGGTGGAC AGGCTGCCGG TCGAGGCGCA CGACCGGTCT TTGGACGGTT 

551 TTGTGTCGGA GGCGGGGATA TTGTGTTTTT AG 



This corresponds to the amino acid sequence <SEQ ED 652; ORF 157>: 

ml 57 .pep 

1 MRNEEKRALR RELRGRRSQM GRDVRAAATV KINHLLKRYI KKGRKIGVYW 

51 PMGKELRLDG FVRAAQKRGA ELYLPYIEPR SRRMWFTPYP ADGVKQERKR 

101 GRAKLHVPQF AGRKKRVHDL NLLLVPWGM DRLGYRLGQA GGYYDATLSA 

151 MKYRLQAKTV GVGFACQLVD RLPVEAHDRS LDGFVSEAGI LCF* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF firom N. gonorrhoeae 

mis? / gl57 88.1% identity in 193 aa overlap 



10 20 30 40 50 60 

ml57 . pep MRNEEKRALRRELRGRRSQMGRDVRAAATVKINHLLKRYIKKGRKIGVYWPMGKELRLDG 
II Ml I IN II I II 11 11-1 III I II |||::|M: III I II 1:111 I II I II ill I II I I 
gl57 MRNEEKRALRRELRGRRSQMGRDVRAAAAIKINRLLKRYIKRGRKIGVYWPMGKELRLGG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 57 . pep FVRAAQKRGAELYLPYIEPRSRRMWFTPYPADGVKQERKRGRAKLHVPQFAGRKKRVHDL 
IIMIIIIII:|lllllll::||||ltll| I ::: II I I I I I i II I II I I I I I Ml | 
gl57 FVRAAQKRGAKLYLPYIEPHTRRMWFTPYPERGMERERKRGRAKLHVPQFAGRKIRVHGL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 57 . pep NLLLVPWGMDRLGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPVEAHDRS 

- It I I : I I : I I I I I I I I I I I I I I I I I II 11 I I I I I I I I I I I I II I I I I I I I I I I I 
gl57 SVLLVPLVGIDREGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPREAHDLP 

130 140 150 160 170 180 



190 

ral57 . pep LDGFVSEAGI LC FX 

I I II I I I I II I I I I 
gl57 LDGFVSEAGILCFX 
190 

The following partial DNA sequence w^as identified in A^. meningitidis <SEQ ID 653>: 

al57.seq 

1 ATGAGGAACG AGGAAAAACA CGCCTTGCGC CGAGAGTTGC GCCGCGCCCG 
51 CGCGCAGATG GGGCATCAAG GGCGGTTGGC GGCGGGGCAA ACGATTAACC 
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101 GCCTGCTCAA ACGTTATATC AAGCGTGGTC GGAAAATCGG CGTGTATTGG 

151 CCGATGGGCA AGGAATTGCG TTTGGACGGC TTTGTCCGCG CGGCGCAAAA 

201 ACGCGGTGCA AAACTTTATC TGCCTTATAT CGAACCGCGT TCGCGGCGGA 

251 TGTGGTTTAC GCCGTATCCT GAAAGCGGAA TGGAACGGGA GCGCATACGG 

301 GGCAGGGCGA AGTTGAACGT GCCGCAGTTT GCAGGGCGCA AAATCCGCGT 

351 GCACGGTTTG TCGGTATTGC TCGTCCCGCT TGTCGGCATA GACCGCGAGG 

401 GCTACCGCTT AGGACAGGCA GGCGGCTATT ACGATGCGAC GCTTGCGGCG 

451 ATGAAATACC GTTTGCAGGC AAAAACCGTG GGCGTGGGCT TTGCCTGCCA 

501 GTTTGTGGAC AGGCTGCCGC GCGAACCGCA CGATCTGCTG CTGGACGGTT 

551 TTGTGTCGGA GGCGGGGATA TTGTGCTTTT AG 



This corresponds to the amino acid sequence <SEQ ID 654; ORF 157.a>: 

al57 .pop 

1 MRNEEKHALR RELRRARAQM GHQGRLAAGQ TINRLLKRYI KRGRKIGVYW 

51 PMGKELRLDG FVRAAQKRGA KLYLPYIEPR SRRMWFTPYP ESGMERERIR 

101 GRAKLNVPQF AGRKIRVHGL SVLLVPLVGI DREGYRLGQA GGYYDATLAA 

151 MKYRLQAKTV GVGFACQFVD RLPREPHDLL LDGFVSEAGI LCF* 

inl57/al57 82.4% identity in 193 aa overlap 



10 20 30 40 50 60 

ml 57 . pep MRNEEKRALRRELRGRRSQMGRDVRAAATVKINHLLKRYIKKGRKIGVYWPMGKELRLDG 
I I I I i I : I I I I I I [ I : I I ) : : I II I I : I I II I II : I II I II I II II i 1 II 1 I I 
al57 MRNEEKHALRRELRRARAQMGHQGRLAAGQTINRLLKRYIKRGRKIGVYWPMGKELRLDG 

10 20 30 40 50 60 



70 80 90 100 110 120 

inl57 . pep FVRAAQKRGAELYLPYIEPRSRRMWFTPYPADGVKQERKRGRAKLHVPQFAGRKKRVHDL 
I I I I I I I I I I : I I II I I I I I I I I I I II I I I : I : : : I I I I I I I I : I I I I I I I I III I 
al57 FVRAAQKRGAKLYLPYIEPRSRRMWFTPYPESGMERERIRGRAKLNVPQFAGRKIRVHGL. 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml57 .pep NLLLVPWGMDRLGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPVEAHDRS 

: : I I I 1 : I I : I I I I I I M I M I I I II I : I I I I I t I I I I I I I I I I I 1 : M I I I I II 
al57 SVLLVPLVGIDREGYRLGQAGGYYDATLAAMKYRLQAKTVGVGFACQFVDRLPREPHDLL 

130 140 150 160 170 180 



190 

ml57 .pep LDGFVSEAGILCFX 
I I I I I I I I I I I I I I 
al57 LDGFVSEAGILCFX 
190 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 655>: 

gl58 . seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGgagcAGTT GGAGAtggCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGCGT GAAcCTGCtc 

151 aACCGCACCA CGCGGCAACT CAATCTGACG GAAGAAGGCG CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAGTA CCGCAAGGCG TGTTGCGCGT GGATTCCGCG 

301 ATGCcgatgg TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG . CATATCcgaC TTTCGCTCGT TTCTTCCGAa ggctatatca 

401 atctGattGA Acgcaaagtc gAtatTGCCT TACGGGCCGG AGAATTGGAC 

4 51 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCACT TCCGCGtagt 

501 cgCCAGTCCT GAATATTTAG CAAAACACGG CACGCCACAA TCTGCAGAAG 

551 atcTTGCCAA CCATCAATGT TTAGGCTTCA CAGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGAtgC GCAGGGAAAT CCCTATAAAA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAAGtt 

701 gCGGTATTGC TTGCTTATCA GATTTTTTGG TTGACAACGA CATCACTGAA 

751 GGAAAGTTAA TTCCcctatt cgCCGAACAA ACCTCCAATA AAACACACCC 
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801 CTTT7VATGCT GTTTATTACA GCGATAAAGC CGTCAACCTC CGCTTACGCG 
851 TATTTTTGGA TTTTTTAGTG AAGGAACTGG GAAAAAATAT GAATAGAACG 
901 AATACCAAAT AA 

This corresponds to the amino acid sequence <SEQ E) 656; ORF 158.ng>: 

gl58 .pep 

1 MKTNSEELTV FVQVVESGSF SRAA EQLEMA NSAVSRIVKR LEEKLGVNLL 

51 NRTTRQLNLT EEGAQYFRRA QRILQEMAAA ETEMLAVHEV PQGVLRVDSA 

101 MPMVLHLLAP LAAKFNERYP HIRLSLVSSE GYINLIERKV DIALRAGELD 

151 DSGLRARHLF DSHFRWASP EYLAKHGTPQ SAEDLANHQC LGFTEPGSLN 

201 TWAVLDAQGN PYKISPHFTA SSGEIL RSLC LSSCGIACLS DFLV DNDITE 

251 GKLIPLFAEQ TSNKTHPFNA VYYSDKAVNL RLRVFLDFLV KELGKNMNRT 

301 NTK* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 657>: 

ml58 .seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGGAGCAGTT GGCGATGGCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGTGT GAACCTGCTC 

151 AACCGCACCA CGCGGCAACT CAGTCTGACG GAAGAAGGCG CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAATA CCGCAAGGCG TGTTGAGCGT GGATTCCGCG 

301 ATGCCGATGG TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG CATATCCGAC TTTCGCTCGT TTCTTCCGAA GGCTATATCA 

401 ATCTGATTGA ACGCAAAGTC GATATTGCCT TACGGGCCGG AGAATTGGAC 

451 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCGCT TCCGCGTAAT 

501 CGCCAGTCCT GAATACCTGG CAAAACACGG CACGCCGCAA TCTACAGAAG 

551 AGCTTGCCGG CCACCAATGT TTAGGCTTCA CCGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGATGC GCAGGGAAAT CCCTATAAGA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAGGTT 

701 GCGGTATTGT TTGCTTATCA GATTTTTTGG TTGACAACGA CATCGCTGAA 

751 GGAAAGTTAA TTCCCCTGCT CGCCGAACAA ACCTCCGATA AAACACACCC 

801 CTTTAATGCT GTTTATTACA GCGATAAAGC CGTCAATCTC CGCTTACGCG 

851 TATTTTTGGA TTTTTTAGTG GAGGAACTGG GAAACAATCT CTGTGGATAA 

This corresponds to the amino acid sequence <SEQ ID 658; ORF 158>: 

ml 58 .pep 

1 MKTNSEELTV FVQVVESGSF SRAA EQLAMA NSAVSRIVKR LEEKLGVNLL 

51 NRTTRQLSLT EEGAQYFRRA QRILQEMAAA ETEMLAVHEI PQGVLSVDSA 

101 MPMVLHLLAP LAAKFNERYP HIRLSLVSSE GYINLIERKV DIALRAGELD 

151 DSGLRARHLF DSRFRVIASP EYLAKHGTPQ STEELAGHQC LGFTEPGSLN 

201 TWAVLDAQGN PYKISPHFTA SSGEIL RSLC LSGCGIVCLS DFLV DNDIAE 

251 GKLIPLLAEQ TSDKTHPFNA VYYSDKAVNL RLRVFLDFLV EELGNNLCG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

94,3% identity in 297 aa overlap 

10 20 30 40 50 60 

MKTNSEELTV FVQWESGSFSRATVEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 
I I I I I I I I I I I I I [ I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I M [ I I I I I I I : I I 
MKTNSEELTVFVQVVESGSFSRAAEQLEMANSAVSRIVKRLEEKLGVNLLNRTTRQLNLT 
10 20 30 40 50 60 

70 80 90 100 110 120 

EEGAQYFRRAQRILQEMAAAETEMLAVHEIPQGVLSVDSAMPMVLHLLAPLAAKFNERYP 
i t I I I I I I I I I I I I I I I I I I I I I I I I M I : I t I i I I I I I I I I I I I I I I I I t I I I I I I I I 
EEGAQYFRRAQRILQEMAAAETEMLAVHEVPQGVLRVDSAMPMVLHLLAPLAAKFNERYP 
70 80 90 100 110 120 



ml58 / gl58 

ml58 .pep 
gl58 

ml 58 .pep 
gl58 
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ml58 .pep 
gl58 

ml58 .pep 
gl58 

ml 58 .pep 
gl58 

gl58 



130 140 150 160 170 180 

HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 
I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I ) I I I I I ) I I I I : I I I : I ) I I I t I 11 I I i I 
HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSHFRWASPEYLAKHGTPQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIVCLS 
l:|:tl:||]|{|||llllllllltllllllllllllllllllllllll|||:|||:|)| 
SAEDLANHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSSCGIACLS 

190 200 210 220 230 240 

250 260 270 280 290 300 

DFLVDNDIAEGKLIPLLAEQTSDKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 

DFLVDNDITEGKLIPLFAEQTSNKTHPFNAVYYSDKAVNLRLRVFLDFLVKELGKNMNRT 
250 260 270 280 290 3O0 

NTKX 



The following partial DNA sequence was 

al58 . seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA 

51 CGGCAGCTTC AGCCGTGCGG CGGAGCAGTT 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA 

151 AACCGCACCA CGCGGCAACT CAGTCTGACG 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT 

251 TGCTGGCAGT GCACGAAATA CCGCAAGGCG 

301 ATGCCGATGG TGCTGCATCT GCTGGCGCCG 

351 ACGCTATCCG CATATCCGAC TTTCGCTCGT 

401 ATCTGATTGA ACGCAAAGTC GATATTGCCT 

451 GATTCCGGGC TGCGTGCACG CCATCTGTTT 

501 CGCCAGTCCT GAATACCTGG CAAAACACGG 

551 AGCTTGCCGG CCACCAATGT TTAGGCTTCA 

601 ACATGGGCGG TTTTAGATGC GCAGGGAAAT 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG 

701 GCGGTATTGC TTGCTTATCA GATTTTTTGG 

751 GGAAAGTTAA TTCCCCTGCT CGCCGAACAA 

801 CTTTAATGCT GTTTATTACA GCGATAAAGC 

851 TATTTTTGGA TTTTTTAGTG GAGGAACTGG 



identified in N. meningitidis <SEQ ID 659>: 



TTTGTTCAAG 
GGCGATGGCA 
AGTTGGGTGT 
GAAGAAGGCG 
GGCAGCGGCG 
TGTTGCGCGT 
CTGGCAGCAA 
TTCTTCCGAA 
TACGGGCCGG 
GACAGCCGCT 
CACGCCGCAA 
CCGAACCCGG 
CCCTATAAGA 
CTCGTTGTGC 
TTGACAACGA 
ACCTCCAATA 
CGTCAACCTC 
GAAACAATCT 



tggtggaaag 
aattctgccg 
gaacctgctc 
cgcaatattt 
gaaaccg;^ 
ggattccgcg 
aattcaacga 
ggctatatca 
agaattggac 
tccgcgtaat 
tctacagaag 
ttctctaaat 
tttcaccgca 
ctttcaggtt 
catcgctgaa 
aaacgcaccc 
cgcttacgcg 
ctgtggataa 



This corresponds to the amino acid sequence <SEQ JD 660; ORF 158.a>: 



al58.pep 

1 
51 
101 
151 
201 



MKTNSEELTV FVQWESGSF SRAAEQLAMA 



NRTTRQLSLT EEGAQYFRRA 
MPMVLHLLAP LAAKFNERYP 
DSGLRARHLF DSRFRVIASP 
TWAVLDAQGN PYKISPHFTA 



QRILQEMAAA 
HIRLSLVSSE 
EYLAKHGTPQ 
SSGEILRSLC 



NSAVSRIVKR 
ETEMLAVHEI 
GYINLIERKV 
STEELAGHQC 
LSGCGIACLS 



LEEKLGVNLL 
PQGVLRVDSA 
DIALRAGELD 
LGFTEPGSLN 
DFLVDNDIAE 



251 GKLIPLLAEQ TSNKTHPFNA VYYSDKAVNL RLRVFLDFLV EELGNNLCG* 
ml58/al58 99.0% identity in 299 aa overlap 



10 20 30 40 50 60 

ml58 . pep MKTNSEELTVFVQWESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 
IIIIMIIIIIlllllllllllllllllllllMllllMllllllllllllllllllil 
al58 MKTNSEELTVFVQWESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 

10 20 30 40 50 60 

70 80 90 100 110 120 

miss . pep EEGAQYFRRAQRILQEMAAAETEMLAVHEIPQGVLSVDSAMPMVLHLLAPLAAKFNERYP 
I 1 I I I I M I I I I I I I I I I I I I I I I 1 I I i I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a 1 5 8 EEGAQYFRRAQRI LQEMAAAETEMLAVHE I PQGVLRVDS AM PMVLHL LAP LAAKFNERYP 

70 80 90 100 110 120 
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130 140 150 160 170 180 

ml58 . pep HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 
I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I i I I I I I I M I I I I I M I I I M I I I I I 
a 1 5 8 HIRLSLVS SEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIAS PE YLAKHGT PQ 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml58 . pep STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIVCLS 
I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I j I I I I : I I I 
al58 STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIACLS 

190 200 210 220 230 240 



250 260 270 280 290 300 

miss . pep DFLVDNDIAEGKLIPLLAEQTSDKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 
I I M I I i I I I I I I I I I I I 11 M : I I i I I I I I I I i I I I I I I I I I 1 I I I I i I I I I I I I I I I I 
al58 DFLVDNDIAEGKLIPLLAEQTSNKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 

250 260 270 280 290 300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 661>: 

gl60 . seq 

1 ATGGAcattc tgGACAAact ggtcgatCTC GCccaATTGA CGGGCAGTGC 

51 GGATGTGCAG TgcctTTTGG GCGGACTW^TG gcATGaaacc TTGCAACGCG 

101 AAGGGCTGGT ACACATTGTT ACGGCGGGCA GCGGTTATCT CTGCATCGAC 

151 GGCGAAACTT CCCCGCGTCC GGTCGGCACG GGCGATATTG TATTTTTCCC 

201 GCGCGGCTTG GGTCATGTGT TGAGCCACGA CGGAAAATAC GGAGAAAGTT 

251 TACAACCGGA CATACGACAA AACGGCACAT TTATGGTCAA ACAGTGCGGC 

301 AACGGGCTGG ATATGAGCCT GTTTTGCGCC CGTTTCCGCT ACGACACCCA 

351 CGCCGATTTG ATGAACGGGC TGCCGGAAAC CGTTTTTCTG AACATTGCCC 

401 ATCCAAGTTT GCAGTATGTG GTTTCAATGC TGCAACTGGA AAGCGAAAAA 

451 CCTTTGACGG GGACGGTTTC CGTGGTCAAC GCATTACCGT CCGTGCTGCT 

501 GGTGCTTATC CTGCGCGCCT ATCTCGAACA GGATAAGGAT GTCGAACTCT 

551 CGGGCGTATT GAAAGGTTGG CAGGACAAAC GTTTGGGACA TTTGATCCAA 

601 AAGGTGATAG ACAAACCGGA AGACGAATGG AATATTGACA AAATGGTTGC 

651 CGCCGCCAAT ATGTCGCGCG CGCAACTGAT GCGCCGCTTC AAAAGCCAAG 

701 TCGGACTCAG CCCGCACGCC TTTGTGAACC ATATCCGCCT GCAAAAAGGC 

751 GCATTGCTGC TGAAGAAAAC CCCGGATTCG GTTTTGGAGG TCGCGCTGTC 

801 GGTGGGCTTT CAGTCGGAAA CGCATTTCGG CAAGGCGTTC AAACGGCAAT 

851 ATCACGTTTC GCCGGGGCAA TACCGGAAAG AAGGCGGGCA AAAATAA 



This corresponds to the amino acid sequence <SEQ ID 662; ORF 160.ng> : 

gieo.pep 

1 MDILDKLVDL AQLTGSADVQ CLLGGQWHET LQREGLVHIV TAGSGYLCID 

51 GETSPRPVGT GDIVFFPRGL GHVLSHDGKY GESLQPDIRQ NGTFMVKQCG 

101 NGLDMSLFCA RFRYDTHADL MNGLPETVFL NIAHPSLQYV VSMLQLESEK 

151 PLTG TVSWN ALPSVLLVLI L RAYLEQDKD VELSGVLKGW QDKRLGHLIQ 

201 KVIDKPEDEW NIDKMVAAAN MSRAQLMRRF KSQVGLSPHA FVNHIRLQKG 

251 ALLLKKTPDS VLEVALSVGF QSETHFGKAF KRQYHVSPGQ YRKEGGQK* 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 663>: 

mieo. seq 

1 ATGGACATTC TGGACAAACT GGTCGATTTC GCCCAATTGA CGGGCAGTGT 

51 GGATGTGCAG TGCCTTTTGG GCGGACAATG GTCGGTACGG CATGAAACCT 

101 TGCAACGCGA AGGATTGGTA CACATTGTTA CATCGGGCAG CGGCTATCTC 

151 TGCATCGACG GCGAAACTTC CCCGCGTCCG GTCAGTACAG GGGATATTGT 

201 ATTTTTCCCG CGCGGCTTGG GTCATGTGTT GAGCCACGAC GGAAAATGCG 

251 GAGAAAGTTT ACAACCGGAT ATGCGGCAGC ACGGTGCGTT TACGGTCAAG 

301 CAGTGCGGCA ACGGACAGGA TATGAGCCTG TTTTGCGCCC GTTTCCGCTA 

351 CGACACCCAC GCCGATTTGA TGAACGGGCT GCCTGAAACC GTTTTTCTGA 

401 ACATTGCCCA TCCGAGTTTA CAGTATGTGG TTTCAATGCT GCAACTGGAA 

451 AGCAAAAAAC CTTTGACGGG GACGGTTTCC ATGGTCAACG CATTGTCGTC 
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501 CGTCCTGCTG GTGCTTATCC TGCGCGCCTA TCTCGAACAG GATAAGGATG 

551 TCGAACTCTC GGGCGTATTG AAAGGTTGGC AGGACAAACG TTTGGGACAT 

601 TTAATCCAAA AGGTGATAGA CAAACCGGAA GACGAATGGA ATGTCGACAA 

651 MTGGTGGCG GCTGCCAATA TGTCGCGCGC GCAACTGATG CGCCGTTTCA 

701 AAAGCCGGGT CGGACTCAGC CCGCACGCCT TTGTGAACCA TATCCGCCTG 

751 CAAAAAGGCG CGTTGCTGCT GAAAAAAAAC CCGGATTCGG TTTTGTCGGT 

801 CGCACTGTCG GTAGGCTTTC AGTCGGAAAC GCACTTCGGC AAGGCGTTCA 

851 AACGGCAATA TCACGTTTCG CCGGGTCAAT ACCGGAAAGA AggCGGGCAA 

901 AAATAA 

This corresponds to the amino acid sequence <SEQ ID 664; ORF 160>: 

ml 60. pep 

1 MDILDKLVDF AQLTGSVDVQ CLLGGQWSVR HETLQREGLV HIVTSGSGYL 
51 CIDGETSPRP VSTGDIVFFP RGLGHVLSHD GKCGESLQPD MRQHGAFTVK 
101 QCGNGQDMSL FCARFRYDTH ADLMNGLPET VFLNIAHPSL QYWSMLQLE 
151 SKKPLT GTVS MVNALSSVLL VLILRAYLEQ DKDVELSGVL KGWQDKRLGH 
201 LIQKVIDKPE DEWNVDKMVA AANMSRAQLM RRFKSRVGLS PHAFVNHIRL 
251 QKGALLLKKN PDSVLSV/O^S VGFQSETHFG KAFKRQYHVS PGQYRKEGGQ 
301 K* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml60 / gl60 93.4% identity in 301 aa overlap 

10 20 30 40 50 60 

ml 60 . pep MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 

I I I 1 I I I I I I I i I I : i I I I I I I I I j t I I I I 
gl60 MDILDKLVDLAQLTGSADVQCLLGGQW HETLQREGLVHIVTAGSGYLCIDGETSPRP 

10 20 30 40 50 

70 80 90 100 110 120 

ml 60 . pep VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSLFCARFRYDTH 

1:111111111111111 Illllll:lt:i:l llllill I I i I I I I I I I I I I I 

gl60 VGTGDIVFFPRGLGHVLSHDGKYGESLQPDIRQNGTF^4VKQCGNGLDMSLFCARFRYDTH 
60 70 80 90 100 110 

130 140 150 160 170 180 

ml 60 . pep ADLMNGLPETVFLNIAHPSLQYWSMLQLESKKPLTGTVS^5VNALSSVLLVLILRAYLEQ 

) I M I I M I I I I I I I I I I M I I I I I I I I I I I : I I I I I I I I : I f i I I I I I I I I I I I I I I I 
gl 60 ADLMNGLPET VFLN I AH PSLQYWSMLQLESEKPLTGTVSVVNAL PS VLLVL I LRAYLEQ 

120 130 140 150 160 170 

190 200 210 220 230 240 

ml 60 . pep DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNVDKMVAAANMSRAQLMRRFKSRVGLS 

llllillllllllllIIIIIIIIIIIIIIIIIII:tllllllllt(lllll]|l!:|IM 
gl60 DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNIDKMVAAANMSRAQLMRRFKSQVGLS 

180 190 .200 210 220 230 

250 260 270 280 290 300 

ml 60 . pep PHAFVNHIRLQKGALLLKKNPDSVLSVALSVGFQSETHFGKAFKRQYHVS PGQYRKEGGQ 

I I i I 1 I N M I I I I I I I I I : I I I I I I I I I j I I I I I i I i I I I I I I I I I I I I I I i I I I I I I 
gl 60 PHAFVNHIRLQKGALLLKKTPDSVLEVALSVGFQSETHFGKAFKRQYHVS PGQYRKEGGQ 

240 250 260 270 280 290 



ml 60. pep KX 

I I 

gl60 KX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 665>: 

aieo.seq 

1 ATGGACATTC TGGACAAACT GGTCGATTTC GCCCAATTGA CGGGCAGTGT 
51 GGATGTGCAG TGCCTTTTGG GCGGACT^TG GTCGGTACGG CATGAAACCT 
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101 TGCAACGCGA AGGATTGGTA CACATTGTTA CATCGGGCAG CGGCTATCTC 

151 TGCATCGACG GCGAAACTTC CCCGCGTCCG GTCAGTACAG GGGATATTGT 

201 ATTTTTCCCG CGCGGCTTGG GTCATGTGTT GAGCCACGAC GGAAAATGCG 

251 GAGAAAGTTT ACAACCGGAT ATGCGGCAGC ACGGTGCGTT TACGGTCAAG 

301 CAGTGCGGCA ACGGACAGGA TATGAGCCTG TTTTGCGCCC GTTTCCGCTA 

351 CGACACCCAC GCCGATTTGA TGAACGGGCT GCCTGAAACC GTTTTTCTGA 

401 ACATTGCCCA TCCGAGTTTA CAGTATGTGG TTTCAATGCT GCAACTGGAA 

451 AGCAAAAAAC CTTTGACGGG GACGGTTTCC ATGGTCAACG CATTGTCGTC 

501 CGTCCTGCTG GTGCTTATCC TGCGCGCCTA TCTCGAACAG GATAAGGATG 

551 TCGAACTCTC GGGCGTATTG AAAGGTTGGC AGGACAAACG TTTGGGACAT 

601 TTAATCCAAA AGGTGATAGA CAAACCGGAA GACGAATGGA ATGTCGACAA 

651 AATGGTGGCG GCTGCCAATA TGTCGCGCGC GCAACTGATG CGCCGTTTCA 

701 A7UVGCCGGGT CGGACTCAGC CCGCACGCCT TTGTGAACCA TATCCGCCTG 

751 CAAAAAGGCG CGTTGCTGCT GAAA7VAAAAC CCGGATTCGG TTTTGTCGGT 

801 CGCACTGTCG GTAGGCTTTC AGTCGGAAAC GCACTTCGGC AAGGCGTTCA 

851 AACGGCAATA TCACGTTTCG CCGGGTCAAT ACCGGAAAGA AGGCGGGCAA 

901 AAATAA 

This corresponds to the amino acid sequence <SEQ ID 666; ORF 160.a>: 

aieo.pep 

1 MDILDKLVDF AQLTGSVDVQ CLLGGQWSVR HETLQREGLV HIVTSGSGYL 

51 CIDGETSPRP VSTGDIVFFP RGLGHVLSHD GKCGESLQPD MRQHGAFTVK 

101 QCGNGQDMSL FCARFRYDTH ADLMNGLPET VFLNIAHPSL QYWSMLQLE 

151 SKKPLTG TVS MVNALSSVLL VLIL RAYLEQ DKDVELSGVL KGWQDKRLGH 

201 LIQKVIDKPE DEWNVDKMVA AANMSRAQLM RRFKSRVGLS PHAFVNHIRL 

251 QKGALLLKKN PDSVLSVALS VGFQSETHFG KAFKRQYHVS PGQYRKEGGQ 

301 K* 

ml60/al60 100.0% identity in 301 aa overlap 

10 20 30 40 50 60 

MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 
I I I It I I I I j I I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 
10 20 30 40 50 60 

70 80 90 100 110 120 

VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSLFCARFRYDTH 
I I I I i I I I I I I I I I I I t I I I M I I I I I I I I I I I I I I I I I I t I I I M I I I I I I t I I I I I I I 
VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSLFCARFRYDTH 
70 80 90 100 110 120 

130 140 150 160 170 180 

ADLMNGLPETVFLNIAHPSLQYWSMLQLESKKPLTGTVSMVNALSSVLLVLILRAYLEQ 
I I i I M M I I I M I I I I I I I I I I I I I I I I I i 1 I I I I I I I I I I I I I I I I I I I I I M M I I I 
ADLMNGLPETVFLNIAHPSLQYWSMLQLESKKPLTGTVSMVNALSSVLLVLILRAYLEQ 
130 140 150 160 170 180 

190 200 210 220 230 240 

DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNVDKMVAAANMSRAQLMRRFKSRVGLS 
I it I I I I I I I I I I I I I I [ I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I 
DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNVDKb5VAAANMSRAQLMRRFKSRVGLS 
190 200 210 220 230 240 

250 260 270 280 290 300 

PHAFVNHIRLQKGALLLKKNPDSVLSVALSVGFQSETHFGKAFKRQYHVSPGQYRKEGGQ 
I I I I M I I I t I I I II I I I t I I II t I II I I I I t t I I I t I I t II I I II I I I I I I I I I I I I I I 
PHAFVNHIRLQKGALLLKKNPDSVLSVALSVGFQSETHFGKAFKRQYHVSPGQYRKEGGQ 
250 260 270 280 290 300 



ml 60 .pep 
al60 

ml 60 . pep 
al60 

ml 60. pep 
al60 

ml 60, pep 
al60 

ml 60 .pep 
al60 



ml 60 . pep 
al60 



KX 
I I 
KX 
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The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 667>: 

giei.seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTCACCGTTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTACGC TCGGTGCTGC CGCCGTATTG CGGCGCGACA CCTTCCGCAC 

201 GCCCCATTGG AAAAACCACT TAAACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGAC AACCGGCGTT 

301 ACCCTGAGTT ACACCTCGTC GATTTTTttg GCGGTATTTT CCTTCCTGAT 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

401 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

451 CCGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGCAACC GGCGTGGCGA TGTCGTCggt ttgggcgacg 

601 Ctgaccggct ggCACAcccT GTCCTTTcca tcggcagttt ATCtgtCGGG 

651 CATCGGCGTG tccgcgCtgA TTGCCCAaCT GtcgatgAcg cGCGcctaca 

701 aaGTCGGCGA CAAATTCACG GTTGCCTCGC tttCCtaTAt gaccgtcGTC 

751 TTTTCCGCCC TGTCTGCCGC ATTTTTTCTg ggcgaagagc ttttctggCA 

801 GGA7VATACTC GGTATGTGCA TCATTATcct CAGCGGCATT TTGAGCAGCA 

851 TCCGCCCCAT TGCCTTCAAA CAGCGGCTGC AAGCCCTCTT CCGCCAAAGA 

901 TAA 

This corresponds to the amino acid sequence <SEQ ID 668; ORF 161.ng>: 

gl61.pep 

1 MDTAKKDILG SGWMLVAAA C FTVMNVLIKE AST^FALGSG ELVFWRMLFS 

51 TVTLGAAAVL RRDTFRTPHW KNHLNRS MVG TGAMLLLFYA VTHL PLTTGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQ AVL LLGFAGWLL LNPSF RSGQE 

151 PAALAGLAGG AMSGWAYLKV RELSLAGEPG WRVVFYLSAT GVAMSSVWAT 

201 LTGWHTLS FP SAVYLSGIGV SALIA QLSMT RAYKVGDKFT VAS LSYMTW 

251 FSALSAAFFL GEELFW QEIL GMCIIILSGI LSSI RPIAFK QRLQALFRQR 

301 * 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 669>: 

ml 61 . seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTTACCATTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTGCGC TCGGGGCTGC CGCCGTATTG CGTCGGGACA mCTTCCGCAC 

201 GCCCCATTGG AAAAACCACT TA7VACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGGC CACTGGCGTT 

301 ACCCTGAGTT ACACCTCGTC GATTTTTTTG GCGGTATTTT CCTTCCTGAT 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

401 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

451 ACGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGTGACA GGTGTGGCGA TGTCGTCGGT TTGGGCGACG 

601 CTGACCGGCT GGCACACCCT GTCCTTTCCA TCGGCAGTTT ATCTGTCGTG 

651 CATCGGCGTG TCCGCGCTGA TTGCCCAACT GTCGATGACG CGCGCCTACA 

701 AAGTCGGCGA CAAATTCACG GTTGCCTCGC TTTCCTATAT GACCGTCGTT 

751 TTTTCCGCTC TGTCTGCCGC ATTTTTTCTG GGCGAAGAGC TTTTCTGGCA 

801 GGAAATACTC GGTATGTGCA TCATCATCCT CAGCGGTATT TTGAGCAGCA 

851 TCCGCCCCAC TGCCTTCAAA CAGCGGCTGC AATCCCTGTT CCGCCAAAGA 

901 TAA 

This corresponds to the amino acid sequence <SEQ ID 670; ORF 161>: 

ml 61 .pep 

1 MDTAKKDILG SGWMLVAAA C FTIMNVLIKE ASAKFALGSG EL VFWRMLFS 

51 TVALGAAAV L RRDXFRTPHW KNHLNRSMVG TG AMLLLFYA VTHLPLATGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQAVL LLGFAGWLL LNPSFRSGQE 
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151 TAALAGLAGG AMSGWAYLKV RELSLAGEPG WR WFYLSVT GVAMSSVWA T 

201 LTGWHTLS FP SAVYLSCIGV SALIA QLSMT RAYKVGDKFT VAS LSYMTW 

251 FSALSAAFFL GEELFWQ EIL GMCIIILSGI LSSI RPTAFK QRLQSLFRQR 

301 * 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ml61 / gl61 97.0% identity in 300 aa overlap 

10 20 30 40 50 60 

ml 61 . pep MDTAKKDILGSGWMLVAAACFTIMNVLIKEASAKFALGSGELVFWRMLFSTVALGAAAVL 
IIIMIIIIIIIIIIIIIIIII:niMtllllllllll|||||||||||||:||||||| 
gl61 MDTAKKDILGSGWMLVAAACFTVMNVLIKEASAKFALGSGELVFWRMLFSTVTLGAAAVL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml61 . pep RRDXFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSIFLAVFSFLILKE 

lll-IMMIIMIIIII I I i I I I I f I I I I I : I I I [ I I I I I I I I I I I I I I I I I I I 

gl61 RRDTFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLTTGVTLSYTSSIFLAVFSFLILKE 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 61 . pep RISVYTQAVLLLGFAGWLLLNPSFRSGQETAALAGLAGGAMSGWAYLKVRELSLAGEPG 
M I I I I I I I I I I I M I I I I I I I I I I 1 I I I I ) I I I I I I I I I I I I I i I I I I I I I I I I I I I I 
gl61 RISVYTQAVLLLGFAGWLLLNPSFRSGQEPAALAGLAGGAMSGWAYLKVRELSLAGEPG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml61 . pep WRWFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMTRAYKVGDKFT 
M I I I I I t : I I I i i I I I I I I I I I M I I i I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I 
gl61 WRWFYLSATGVAMSSVWATLTGWHTLSFPSAVYLSGIGVSALIAQLSMTRAYKVGDKFT 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml61 , pep VASLSYMTWFSALSAAFFLGEELFWQEILGMCIIILSGILSSIRPTAFKQRLQSLFRQR 
1 I ] I I I t I I I I I I I M I I I i I I I I I I I I I I I I i I I I I I I I 1 I I I I I I I I I I I I : I I I I t 
gl61 VASLSYMTWFSALSAAFFLGEELFWQEILGMCIIILSGILSSIRPIAFKQRLQALFRQR 
250 260 270 280 290 300 

ml 61. pep X 
I 

gl61 X 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 671>: 

al61 . seq 

1 ATGGATACCG CATi/^AAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTTACCATTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTGCGC TCGGGGCTGC CGCCGTATTG CGTCGGGACA CCTTCCGCAC 

201 GCCCCATTGG AAAAACCACT TAAACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGGC CACCGGCGTT 

301 ACCCTGAGTT ACACCTCGTC GATTTTTTTG GCGGTATTTT CCTTCCTGAT 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

401 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

451 ACGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGTGACA GGTGTGGCGA TGTCATCGGT TTGGGCGACG 

601 CTGACCGGCT GGCACACCCT GTCCTTTCCA TCGGCAGTTT ATCTGTCGTG 

651 CATCGGCGTG TCCGCGCTGA TTGCCCAACT GTCGATGACG CGCGCCTACA 

701 AAGTCGGCGA CAAATTCACG GTTGCCTCGC TTTCCTATAT GACCGTCGTT 

751 TTTTCCGCTC TGTCTGCCGC ATTTTTTCTG GCCGAAGAGC TTTTCTGGCA 

801 GGAAATACTC GGTATGTGCA TCATCATCCT CAGCGGTATT TTGAGCAGCA 
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851 TCCGCCCCAC TGCCTTCAAA CAGCGGCTGC AATCCCTGTT CCGCCAAAGA 

901 TAA 

This corresponds to the amino acid sequence <SEQ ID 672; ORF 16La>: 

aiei.pep 

1 MDTAKKDILG SGWMLVAAA C FTIMNVLIKE ASAKFALGSG EL VFWRMLFS 

51 TVALGAAAV L RRDTFRTPHW KNHLNRSMVG TG AMLLLFYA VTHLPLATGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQ AVL LLGFAGWLL LNPSF RSGQE 

151 TAALAGLAGG AMSGWAYLKV RELSLAGEPG W RWFYLSVT GVAMSSVWA T 

201 LTGWHTLS FP SAVYLSCIGV SALIA QLSMT RAYKVGDKFT VAS LSYMTW 

251 F5ALSAAFFL AEELFWQ EIL GMCIIILSGI LSSI RPTAFK QRLQSLFRQR 

301 * 



ml61/al61 99.3% identity in 300 aa overlap 

10 20 30 40 50 60 

ml 61 . pep MDTAKKDILGSGWMLVAAACFTIMNVLIKEASAKFALGSGELVFWRMLFSTVALGAAAVL 
M t I I I I I I I I I I I I I I I I I I I I I I I I I ] I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
al61 MDTAKKDILGSGWMLVAAACFTIMNVLIKEASAKFALGSGELVFWRMLFSTVALGAAAVL 

10 20 30 40 50 60 

70 80 90 100 110 120 

miei.pep'. RRDXFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSIFLAVFSFLILKE 

t I I I I I I I I I I I I I I I I I I I I I I I i I j I I I I I t I I I I I I I I I I I I I I I I I 

al61 RRDTFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPIiATGVTLSYTSSIFLAVFSFLILKE 

70 80 90 100 110 120 

130 140 150 160 170 180 

miei.pep RISVYTQAVLLLGFAGWLLLNPSFRSGQETAALAGLAGGAMSGWAYLKVRELSLAGEPG 

1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

a 1 6 1 RI S VYTQAVLLLGFAGWLLLNPS FRSGQETAALAGLAGGAMSGWAYLKVRELSLAGE PG 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml61 . pep WRWFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMTRAYKVGDKFT 
I I I M I i I I I I 11 I I I I I I I M i I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I i I 
al61 WRWFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMTRAYKVGDKFT 

190 200 210 220 230 240 

250 260 270 280 290 300 

miei.pep VASLSYMTWFSALSAAFFLGEELFWQEILGMCIIILSGILSSIRPTAFKQRLQSLFRQR 
i I I M I I I I I M I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I i I I I I I I I 
aiei VASLSYMTWFSALSAAFFLAEELFWQEILGMCIIILSGILSSIRPTAFKQRLQSLFRQR 

250 260 270 280 290 300 



miei . pep 

aiei 



X 

I 

X 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 673>: 

gl63 . seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG gctCGACCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTt 

151 ctgGGTTTtc tgctGATACT CTCGGTCAGC GGTTTGGGAA ACATcagGCT 

201 AGGACGGGAT GAAGATGTGC CGGAATTCGG CTTCCTGTCG TGGCTGGCGA 

251 TGCTGTTTGC GGCCGGGATG GGCGTGGGCC TGATGTTTTT CGGCGTGGCA 

301 GAGCCGTTGA TGCATTATTT TTCGGACATT ACGGTCGGCG CGCCGGAACA 

351 CAGGCAGCAG CAGGCATTGC TGCACACGGT GTTCCATTGG GGCGTTCACG 

4 01 CCTGGTCGGT GTACGGTACG ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 



wo 99/57280 



PCT/US99/09346 



460 



451 CGCTACAAAC TGCCGCTTGC CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 

501 AGAAAAAATT TCCGGAAGGT TCGGCGATGC CATTGATATT ATGGCGTTGC 

551 TTGCTACTTT TTTCGGCATC ATCACCACAT TGGGGTTCGG GGCTTCGCAA 

601 CTGGGCGCCG GATTGCAGGA AATGGGCTGG ATTGCCGAAA ACAGCTTCGG 

651 CGTGCAGGTC TTGATTATCG CCGCCGTAAT GTCCCTCGCC GTCGTTTCGG 

701 CAATATCCGG CGTGGGGAAG GGCGTGAAGG TGTTGAGCGA GTTGAACCTG 

751 GGCCTTGCGT TTTTGCTGCT GTTTTTTGTT TTGGCGGCGG ACCCCACTGT 

801 TTACCTGTTG TCGGCATTCG GCGACAACAT AGGGAACTAC CTCGGAAATC 

851 TGGTGCGCCT CAGTTTGAAA ACTTATGCGT ACGAACGGGA ACACAAGCCG 

901 TGGTTTGAAT CTTGGACGGT GCTTTATTGG GCGTGGTGGT GTTCTTGGgc 

951 gcCGTTTGTG GGTTTGTTTA TCGCGCGCAT TTCAAAGGGg cgcaccatCc 

1001 gcgagtttgt CTTCGGGGTT TTGCTCATCC CCGGCCTGTT CGGCGTTTTG 

1051 TGGTTTACCG TCTTCGGCAA TACGGCGATT TGGCTGAATG ACGGGGTTGC 

1101 GGGGGGAATG CTCGTVAAAGA TGACCTCCTC TCCGGAAACG CTGCTTTTTA 

1151 AATTCTTTAA TTACCTCCCC CTGCCCG7VAC TGACGAGCAT CGTCAGCCTG 

1201 CTGGTCATTT CCCTGTTTTT TGTAACTTCT GCCGACTCCG GGATTTATGT 

1251 CCTGAACAAT ATTACCTCTC GGGACAAAGG CTTGAGCGCG CCACGGTGGC 

1301 AGGCGGTTAT GTGGGGCGTG CTGatgtcTG CCGTTGCCGT TTTGCTGATG 

1351 CGCTCGGGCG GACTCGGCAA CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 

1401 GCCGTTTGCC CTGCTGATGC TGATAATGTG TTTCAGCCTG TGGAAAGGCT 

1451 TGAGTGCGGA TAAGAAATAT TTTGAGACCC GGGTCAACCC TACCAGTGTA 

1501 TTTTGGACGG GCGGCAAGTG GAAAGAACGG CTGGTGCGGA TAATGAGCCA 

1551 GACGCAGGAG CAGGATATTT TAAAATTCCT CAAACATACC GCATCGCCCG 

1601 CTATGCACGA GTTGCAACGG GAGCTTTCGG AAGAATACGG CTTGAGCGTC 

1651 CGGGTCGATA AGATGTTTCA TCAGGACGAG CCCGCAATCG AGTTCGTCAT 

1701 TCGGAAAGAG ACGATGCGCG ATTTTATGTA CGGGATTAAG TCTGTCGGGC 

1751 AGGATGTATC CGACCAGTTG ATTAACGACG GCAAGCTGCC GCATATCCGG 

1801 CACCAGACAA CTTACAAACC CTACGCTTAT TTTTTCGACG GGCGCGTCGG 

1851 GTACGATGTG CAGTATATGA ACAAGGACGA GCTGATTGCC GACATTTTGA 

1901 AAAACTACGA ACGTTATTTG ATGTTGTTGG ATGATGTCGG TCAGGAACTG 

1951 ATGGCGCACG AGCAGGTGGA ATTGGCAGAG TAA 

This corresponds to the amino acid sequence <SEQ ID 674; ORF 163.ng>: 

gl63 .pep 

1 M VILTTLFBV CVLWLVL TV PDQVQMWLDR AKEVIFTEFS WFY VLTFSIF 

51 LGFLLILSVS GLGNIRLGRD EDVPEFGFLS WL AMLFAAGM GVGLMFFGVA 

101 EPLMHYFSDI TVGAPEHRQQ QALLHTVFHW GVHAWSVYGT lALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAID I MALLATFFGI ITTLGF GASQ 

201 LGAGLQEMGW lAENSFGVQ V LIIAAVMSLA WSAIS GVGK GVKVLSELNL 

251 GLAFLLLFFV lAAD PTVYLL SAFGDNIGNY LGNLVRLSLK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGM LEKMTSSPET LLFKFFNYLP LPE LTSIVSL 

401 LVISLFFVTS ADSGIYVLNN ITSRDKGLSA PRW QAVMWGV LMSAVAVLLM 

4 51 RSGGLGNLQS M TLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVRIMSQTQE QDILKFLKHT ASPAMHELQR ELSEEYGLSV 

551 RVDKMFHQDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 

The following partial DNA sequence was identified in A'^ meningitidis <SEQ ID 675>: 

ml 63 . seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG GCTCGATCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTT 

151 CTGGGTTTCC TGCTGATACT CTCGGTCAGC AGTTTGGGAA ACATCAGGCT 

201 CGGACGGGAT GAAGATGTGC CGGAATTCGG CTTCCTGTCG TGGCTGGCGA 

251 TGCTGTTTGC GGCCGGGATG GGCGTGGGTC TGATGTTTTT CGGCGTGGCA 

301 GAGCCGTTGA TGCATTATTT TTCGGACATT ACGGCCGGCA CGCCGGMCA 

351 CAGGCAGCAG CAGGCATTGC TGCACACGGT GTTCCATTGG GGCGTTCACG 

401 CTTGGTCGGT GTACGGTACG ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 

451 CGCTACAAGC TGCCGCTTGC CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 

501 AGAAAAAATT TCCGGAAGGT TCGGCGATGC CATTGATATT ATGGCGTTGC 

551 TTGCTACTTT TTTCGGCATC ATCACCACAT TGGGGTTCGG GGCTTCGCAA 

601 CTGGGCGCCG GATTGCAGGA AATGGGCTGG ATTGCCGAAA ACAGCTTCAG 



wo 99/57280 




PCT/US99/09346 



651 CGTGCAGGTT TTGATTATCG CCGCCGTCAT GTCCCTCGCC GTCGTTTCGG 

701 CAATATCCGG CGTGGGGAAG GGCGTGAAGG TGTTGAGCGA GTTGAACCTG 

751 GGCCTTGCGT TTTTGCTGCT GTTTTTTGTT TTGGCGGCGG GACCCACTGT 

801 TTACCTGTTG TCGGCATTCG GCGACAACAT AGGGAACTAC CTCGGAAATC 

851 TGGTGCGCCT CAGTTTTAAA ACTTATGCGT ACGAACGGGA ACACAAGCCG 

901 TGGTTTGAAT CTTGGACGGT GCTTTATTGG GCGTGGTGGT GTTCTTGGGC 

951 GCCGTTTGTG GGTTTGTTTA TCGCGCGCAT TTCAAAGGGG CGCACCATCC 

1001 GCGAGTTTGT CTTCGGGGTT TTGCTCATCC CCGGCCTGTT CGGCGTTTTG 

1051 TGGTTTACCG TCTTCGGCAA TACGGCGATT TGGCTGAATG ACGGGGTTGC 

1101 GGGGGGAATG CTCGAAAAGA TGACCTCCTC TCCGGAAACG CTGCTTTTTA 

1151 AATTCTTTAA TTACCTCCCC CTGCCCGAAT TGACGAGCAT CGTCAGCCTG 

1201 CTGGTCATTT CTCTGTTTTT TGTAACTTCT GCCGATTCCG GGATTTATGT 

1251 CCTGAACAAT ATTACCTCTC GGGACAT^GG CTTGAGCGCG CCACGGTGGC 

1301 AGGCGGTTAT GTGGGGCGTG CTGATGTCTG CCGTTGCCGT TTTGCTGATG 

1351 CGCTCGGGCG GACTCGGCAA CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 

1401 GCCGTTTGCC CTGCTGATGC TGATAATGTG TTTCAGCCTG TGGAAAGGCT 

1451 TGAGTGCGGA TAAGAAATAT TTTGAGACCC GGGTTAACCC TACCAGTGTA 

1501 TTTTGGACGG GCGGCAAGTG GAAAGAACGG CTGGTGCAGA TAATGAGCCA 

1551 GACGCAGGAG CAGGATATTT TAAAATTCCT CAAACAGACT GCATCGCCCG 

1601 CTATGCACGA GTTGCAACGG GAGCTTTCGG AAGAATACGG CTTGAGCGTC 

1651 CGGGTCGATA AAATGTTTCA TCGGGACGAG CCCGCAATCG AGTTCGTCAT 

1701 TCGGAAAGAG ACGATGCGCG ATTTTATGTA CGGGATTAAG TCTGTCGGGC 

1751 AGGATGTATC CGACCAGTTG ATTAACGACG GCAAGCTGCC GCATATCCGG 

1801 CATCAGACAA CTTACAAACC CTACGCTTAT TTTTTCGACG GGCGCGTCGG 

1851 GTACGATGTG CAGTATATGA ACAAGGACGA GCTGATTGCC GACATTTTGA 

1901 AAAACTACGA ACGTTATTTG ATGTTGTTGG ATGATGTCGG TCAGGAACTG 

1951 ATGGCGCACG AGCAGGTGGA ATTGGCAGAG TAA 

This corresponds to the amino acid sequence <SEQ ID 676; ORF 163>: 

ml 63 .pep 

1 MVILTTLFFV CVLWLVL TV PDQVQMWLDR AKE VIFTEFS WFYVLTFSIF 

51 LGFLLILSVS SLGNIRLGRD EDVPEFGFLS W LAMLFAAGM GVGLMFFGVA 

101 EPLMHYFSDI TAGTPEHRQQ QALLHTVFHW GVHAWSVYGT lALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAID I MALLATFFGI ITTLGF GASQ 

201 LGAGLQEMGW lAENSFSVQ V LIIAAVMSLA WSAIS GVGK GVKVLSELNL 

251 GLAFLLLFFV LAAG PTVYLL SAFGDNIGNY LGNLVRLSFK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGM LEKMTSSPET LLFKFFNYLP LPE LTSIVSL 
LVISLFFVTS ADSGIYVLNN ITSRDKGLSA PRW QAVMWGV LMSAVAVLLM . 

451 RSGGLGNLQS M TLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVQIMSQTQE QDILKFLKQT ASPAMHELQR ELSEEYGLSV 

551 RVDKMFHRDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



ml63 / gl63 98.6% identity in 660 aa overlap 

10 20 30 40 50 60 

ml 6 3 . pep MVILTTLFFVCVLWLVLTVPDQVQMWLDRAKEV I FTEFSWFYVLTFS I FLGFLLI LSVS 

I i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 

gl63 MVILTTLFFVCVLWLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml 63 . pep SLGN I RLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDI TAGTPEHRQQ 

: f I I I M M I I I I I i I I I I I I I I I I I I I I I I M I I M I I I I I M I I I I M I : I : I It I I I 
gl63 GLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVMPLMHYFSDITVGAPEHRQQ 

70 80 90 100 110 120 



130 140 150 160 170 180 
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Itil 63 . pep QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 
I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I i I M I I I I I I I I I I I I I 
gl63 QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 
130 140 150 160 170 180 

190 200 210 220 230 240 

inl63 . pep MALLATFFGIITTLGFGASQLGAGLQEMGWIAENSFSVQVLIIAAVMSLAWSAISGVGK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I 
gl63 MALLATFFGIITTLGFGASQLGAGLQEMGWIAENSFGVQVLIIAAVMSLAWSAISGVGK 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 63 . pep GVKVLSELNLGLAFLLLFFVLAAGPTVYLLSAFGDNIGNYLGNLVRLSFKTYAYEREHKP 
I I I I I I I i I t I I I t I t I I I I I I I I I I I I ) I I I I [ I I I I I I 1 I I I I I I : I I I I I i I I I 1 1 
gl63 GVKVLSELNLGLAFLLLFFVLAADPTVYLLSAFGDNIGNYLGNLVRLSLKTYAYEREHKP 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 63 , pep WFESWTVLYWAWWCSWAPFVGLFIARISKGRTIREFVFGVLLIPGLFGVLWFTVFGNTAI 

I It I I I I I I I I I I I I I I I I I I I [ I I I I I t I I I I t I I I I I I I I f I I I I I I I I I I 

gl63 WFESWTVLYWAWWCSWAPFVGLFIARISKGRTIREFVFGVLLIPGLFGVLWFTVFGNTAI 
310 320 330 340 350 360 

370 380 390 400 410 420 

nil63.pep WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 
I I I I I I I If I I i I 1 I I I I I I I I I I I I I I I I I I t I I I I I I I i I I I I I I I I I i I I I 1 I I 1 I I 
gl63 WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 

370 380 390 400 410 420 

430 440 450 460 470 480' 

ml 63. pep ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 
I I I I I ) I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ] t I { M M I I I I I I I I I 
gl63 ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 63 . pep WKGLSADKKYFETRVNPTSVFWTGGKWKERLVQIMSQTQEQDILKFLKQTASPAMHELQR 

I I I I M I I I I I I I I I I I I I I I lllihlllllllllllllllilllllllllM 

gl63 WKGLSADKKYFETRVNPTSVFWTGGKWKERLVRIMSQTQEQDILKFIiKHTASPAMHELQR 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 63. pep ELSEEYGLSVRVDKMFHRDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 
M I II I II I I I I i i I i I: i I I I I I I I I I I t I I I I I I I I I [ I I I I M t I I [ I I I I 1 I I I I I 
gl63 ELSEEYGLSVRVDKMFHQDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 63 . pep HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 
M I I I II I I i I 1 I I I I I I I I I I I 1 I I I I i I I I I I I I I I I I I I 1 1 I I I I I I I I j I I I I I I I 
gl63 HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 

610 620 630 640 650 660 



ml 63 . pep X 
I 

gl63 X 



The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 677>: 

al63 . seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG GCTCGATCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTT 

151 CTGGGTTTCC TGCTGATACT CTCGGTCAGC AGTTTGGGAA ACATCAGGCT 
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463 



201 CGGACGGGAT GAAGATGTGC CGGAATTCGG CTTCCTGTCG TGGCTGGCGA 

251 TGCTGTTTGC GGCCGGGATG GGCGTGGGTC TGATGTTTTT CGGCGTGGCA 

301 GAGCCGTTGA TGCATTATTT TTCGGACATT ACGGCCGGCA CGCCGGAACA 

351 CAGGCAGCAG CAGGCATTGC TGCACACGGT GTTCCATTGG GGCGTTCACG 

401 CTTGGTCGGT GTACGGTACG ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 

451 CGCTACAAGC TGCCGCTTGC CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 

501 AGAAAAAATT TCCGGAAGGT TCGGCGATGC CATTGATATT ATGGCGTTGC 

551 TTGCTACTTT TTTCGGCATC ATCACCACAT TGGGGTTCGG GGCTTCGCAA 

601 CTGGGCGCCG GATTGCAGGA AATAGGCTGG ATTGCCGAAA ACAGCTTCAG 

651 CGTGCAGGTT TTGATTATCG CCGCCGTCAT GTCCCTCGCC GTCGTTTCGG 

.701 CAATATCCGG CGTGGGGAAG GGTGTGAAGG TGTTGAGCGA GTTGAACCTG 

751 GGTCTTGCGT TTTTGCTGCT GTTTTTTGTT TTGGCGGCGG GTCCCACTGT 

801 TTACCTGTTG TCGGCATTCG GCGACAACAT AGGGAACTAC CTCGGAAATC 

851 TGGTGCGCCT CAGTTTTAAA ACTTATGCGT ACGAACGGGA ACACAAGCCG 

901 TGGTTTGAAT CTTGGACGGT GCTTTATTGG GCGTGGTGGT GTTCTTGGGC 

951 GCCGTTTGTG GGTTTGTTTA TCGCGCGCAT TTCAAAGGGG CGCACCATCC 

1001 GCGAGTTTGT CTTCGGGGTT TTGCTCATCC CCGGCCTGTT CGGCGTTTTG 

1051 TGGTTTACCG TCTTCGGCAA TACGGCGATT TGGCTGAATG ACGGGGTTGC 

1101 GGGGGGAGTG CTCGAAAAGA TGACCTCCTC TCCGGAAACG CTGCTTTTTA 

1151 AATTCTTTAA TTACCTCCCC CTGCCCGAAT TGACGAGCAT CGTCAGCCTG 

1201 CTGGTCATTT CTCTGTTTTT TGTAACTTCT GCCGATTCCG GGATTTATGT 

1251 CCTGAACAAT ATTACCTCTC GGGACAAAGG CTTGAGCGCG CCACGGTGGC 

1301 AGGCGGTTAT GTGGGGCGTG CTGATGTCTG CCGTTGCCGT TTTGCTGATG 

1351 CGCTCGGGCG GACTCGGCAA CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 

1401 GCCGTTTGCC CTGCTGATGC TGATAATGTG TTTCAGCCTG TGGAAAGGAT 

1451 TGAGTGCGGA TAAGAAATAT TTTGAGACCC GGGTTAACCC TACCAGTGTA 

1501 TTTTGGACGG GCGGCAAGTG GAAAGAACGG CTGGTGCAGA TAATGAGCCA 

1551 GACGCAGGAG CAGGATATTT TAAAATTCCT CAAACATACC GCATCGCCCG 

1601 CTATGCACGA GTTACAACGG GAGCTTTCGG AAGAATACGG CTTGAGCGTC 

1651 CGGGTCGATA AGATGTTTCA TCAGGACGAG CCCGCAATCG AGTTCGTCAT 

1701 TCGGAAAGAG ACGATGCGCG ATTTTATGTA CGGGATTAAG TCTGTCGGGC 

1751 AGGATGTATC CGACCAGTTG ATTAACGACG GCAAGCTGCC GCATATCCGG 

1801 CATCAGACAA CTTACAAACC CTACGCTTAT TTTTTCGACG GGCGCGTCGG 

1851 GTACGATGTG CAGTATATGA ACAAGGACGA GCTGATTGCC GACATTTTGA 

1901 AAAACTACGA ACGTTATTTG ATGTTGTTGG ATGATGTCGG TCAGGAACTG 

1951 ATGGCGCACG AGCAGGTGGA ATTGGCAGAG TAA 



This corresponds to the amino acid sequence <SEQ ID 678; ORF 163.a>: 

al63 .pep 

1 MVILTTLFFV CVLWLVL TV PDQVQMWLDR AKEVIFTEFS WFYVLTFSIF 

51 LGFLLILSVS SLGNIRLGRD EDVPEFGFLS W LAMLFAAGM GVGLMFFGV A 

101 EPLMHYFSDI TAGTPEHRQQ QALLHTVFHW GVHAWSVYGT lALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAI DI M7U,LATFFGI ITTLGF GASQ 

201 LGAGLQEIGW lAENSFSVQ V LIIAAVMSLA WSAIS GVGK GVKVLSELNL 

251 GLAFLLLFFV LAAG PTVYLL SAFGDNIGNY LGNLVRLSFK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGV LEKMTSSPET LLFKFFNYLP LPE LTSIVSL 

401 LVISLFFVTS ADSGIYVLNN ITSRDKGLSA PRW QAVMWGV LMSAVAVLLM 

451 RSGGLGNLQS MTLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVQIMSQTQE QDILKFLKHT ASPMHELQR ELSEEYGLSV 

551 RVDKMFHQDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 



ml63/al63 99.4% identity in 660 aa overlap 

10 20 30 40 50 60 

ml 63 . pep MVILTTLFFVCVLWLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 

III I M I I t I I I i I I I I II I I I I I I f I I I I I I I I I I I I I I 

al63 MVILTTLFFVCVLWLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 
10 20 30 40 50 60 



ml 63 .pep 



■^O 80 90 100 110 120 

SLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITAGTPEHRQQ 
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1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

al63 SLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITAGTPEHRQQ 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 63 . pep QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I It I I I I I t I I I I I I I I I I I I I I I I I I I I I 
al63 QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml63.pep MALLATFFGIITTLGFGASQLGAGLQEMGWIAENSFSVQVLIIAAVMSLAWSAISGVGK 
I I I I I I I 1 I I I I I I I I I I I I I I I i I n : I M I I I ! I I I I I I i I I I I I I I I 1 I I I I I I I I I 
al63 MALLATFFGIITTLGFGASQLGAGLQEIGWIAENSFSVQVLIIAAVMSLAWSAISGVGK 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml 63 . pep GVKVLSELNLGLAFLLLFFVLAAGPTVYLLSAFGDNIGNYLGNLVRLSFKTYAYEREHKP 
i I I i I I I I I t M I I I I I t I I I I I I I I I I I I I I I I I t { I I I I I M I I I I I I t I I I I I I I I I 
al63 GVKVLSELNLGLAFLLLFFVLAAGPTVYLLSAFGDNIGNYLGNLVRLSFKTYAYEREHKP 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml63 . pep WFESWTVLYWAWWCSWAPFVGLFIARISKGRTIREFVFGVLLIPGLFGVLWFTVFGNTAI 
I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I t I I I I I I I I I I I t I I I I I I I I I I I I I 
al63 WFESWTVLYWAWWCSWAPFVGLFIARISKGRTIREFVFGVLLIPGLFGVLWFTVFGNTAI 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml63.pep WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 
I I I i I I I I I : I I I I I I I I M I t I i I 11 I I I I I I I I I I [ I I I I I I I I I I I I I I I I I t t I I I 
al63 WLNDGVAGGVLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 63 . pep ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I M I I I I t I I I I I I I t I I I I I I I [ 
al63 ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml63 . pep WKGLSADKKYFETRVNPTSVFWTGGKWKERLVQIMSQTQEQDILKFLKQTASPAMHELQR 
I I I I I I I I I I I I I I I i i i i I I I I I I I I I I I i I I I I I I I I I I I t [ I i I I : I 1 I I i I I i I I I 
al63 WKGLSADKKYFETRVNPTSVFWTGGKWKERLVQIMSQTQEQDILKFLKHTASPAMHELQR 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml63.pep ELSEEYGLSVRVDKMFHRDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 
I I I i I I I I I I M I I I I I: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al63 ELSEEYGLSVRVDKMFHQDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 63 . pep HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I t i j I I 1 I I I I I I I I I I 
al63 HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 

610 620 630 640 650 660 



ml 63 .pep 
al63 



X 
I 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 679>: 

gl64.8eq (partial) 

1 ..ATGAACACAT TTTTGAAAAA CAGCGAATAC GCGTATATCC TGAACGACTG 

' 51 CAAGGCGCGC TTCCTGTTCG CCTCGGCCGG CCTGTCAAAA GAATTGGCGG 

101 GCCTGAAGGC GCAAACGCCC GTCGAAAAAA TCATTTGGAC GGACAAAAGC 

151 CGGCCGGCCG GCGAAACGGC GGAAGGCGAT GCCTTTTTTG AAAACGTGCG 

201 CCGCTTCCCC GAAAAAGCCG ACTTGGGCCG CCAACCCCGG ATAAATGATT 

251 TGGCACACAT CATCTACACC TCCGGCACGA CGGGGCATCC CAAAGGCGCG 

301 CTAATCAGTT ACGCCAACCT GTTCGCCAAC CTGAACGGCA TCGAACGCAT 

351 CTTtaaAATT TCCAAACGCG ACCGCTTTAT CGTTTTCctg ccgatgTTCC 

401 ACAGCTTCAC GCTGACGGCT ATGGTGCTGC TGCCGATTTA TATGGCGTGT 

451 TCGATTATTT TGGTCAAAtc cgttttCCCc ttttCCaacG TTTTGAAACA 

501 GGCCCTGCTC AAACGCGCAA CCGTGTTTTT GGGCGTACCC GCGATTTACA 

551 CCGCGATGAG CAAGGCAAAA ATCCCTTGGT ATTTCAGATG GTTCAACCGC 

601 ATCCGCCTGT TTATCAGCGG CGGCGCGCCT TTGGCGGAAC AAACCATCCT 

651 CGATTTTAAA GCCAAGTTCC CCCGCGCCAA ATTGCTGGAA GGCTACGGAC 

701 TGAGCGAAGC CTCGCCCGTC GTCGCCGTCA ATACGCCCGA ACGGCAAAAA 

751 GCCCGCAGCG TCGGCATCCC CCTGCCCGGT TTGGAAGCCA AAGCCGTCGA 

801 TGAAGAATTG GTCGAAGTGC CGCGCGGCGA AGTGGGCGAA CTGATCGTCA 

851 GGGGCGGTTC GGTGATGCGG GGCTACCTCA ATATGCCTGC CGCCACCGAT 

901 GAAACCATCG TCAACGGCTG GTTGAAAACG GGCGATTTCG TTACCATAGA 

951 CGAGGACGGC TTTATCTTTA TCGTCGACCG CAAAAAAGAT TTGATTATTT 

1001 CCAAAGGTCA AAACGTCTAT CCGCGCGAGA TCGAAGAAGA AATCCACAAA 

1051 CTCGATGCCG TCGAAGCCGC CGCCGTCATC GGCGTGAAAG ACCGTTATGC 

1101 CGACGAGGAA ATCGTCGCCT TCGTCCAATT GAAGGAAGGT ATGGATTTGG 

1151 GCGAGGACGA aatccgccgc caccTGCGTA CCGTGCTGGC AAATTTCAAA 

1201 ATCCCCAAAC AGATCCACTT TAAAGACGGG CTGCCGCGCA ACGCTACGGG 

1251 CAAAGTATTG AAACGGGTGC TGAAGGAGCA GTTTGAAGGA AACAAATGA 

This corresponds to the amino acid sequence <SEQ ID 680; ORF 164.ng>: 

gl64.pep (partial) 

1 . .MNTFLKNSEY AYILNDCKAR FLFASAGLSK ELAGLKAQTP VEKIIWTDKS 

51 RPAGETAEGD AFFENVRRFP EKPDLGRQPR INDLAHIIYT SGTTGHPKGA 

101 LISYANLFAN LNGIERIFKI SKRDRFIVFL PMFHSFTL TA MVLLPIYMAC 

151 SIILVK SVFP FSNVLKQALL KRATVFLGVP AIYTAMSKAK IPWYFRWFNR 

201 IRLFISGGAP LAEQTILDFK AKFPRAKLLE GYGLSEASPV VAVNTPERQK 

251 TVRSVGIPLPG LEAKAVDEEL VEVPRGEVGE LIVRGGSVMR GYLNMPAATD 

301 ETIVNGWLKT GDFVTIDEDG FIFIVDRKKD LIISKGQNVY PREIEEEIHK 

351 LDAVEAAAVI GVKDRYADEE IVAFVQLKEG MDLGEDEIRR HLRTVLANFK 

401 IPKQIHFKDG LPRNATGKVL KRVLKEQFEG NK* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 681>: 

ml64 . seq 

1 ATGAACCGGA CTTATGCCAA TTTCTACGAA ATGCTCGCCG CCGCCTGCCG 

51 CAAAAACGGA AACGGCACGG CAGTGTTCGA CGGCAAGGAA AAAACCGCCT 

101 ACCGCGCGCT CAAGCAGGAG GCCGAAGCCG TCGCGGCGTA TCTGCAAAAT 

151 ATCGGCGTGA AGTTCGGCGA CACGGTCGCG CTGGCGGTTT CCAATTCCAC 

201 AGAATTTATT ACCGCCTATT TCGCCATCTC CGCCATCGGC GCGGTCGCCG 

251 TACCGATGAA CACATTTTTG AAAAACAGCG AATACGCGTA TATCCTGAAC 

301 GACTGCAAGG CGCGCTTCCT GTTCGCCTCG GCCGGCCTGT CAAAAGAATT 

351 GGCGGGCTTG AAGGCGCAAA CGCCCGTCGA AAAAATCATT TGGACGGACA 

401 AAAGCCGTCC GACCGGCGAA ACGGCGGAAG GCGATGCCTT TTTTGAAGAC 

451 GTGCGCCGCT TCCCCGAAAA ACCCGACTTG GGCCGCCAAC CCCGGATAAA 

501 TGATTTGGCA CACATCATCT ACACCTCCGG CACGACGGGG CATCCCAAAG 

551 GCGCGCTAAT CAGTTACGCC AACCTGTTCG CCAACCTGAA CGGCATCGAA 

601 CGCATCTTTA AAATTTCCAA GCGCGACCGC TTTATCGTTT TCCTGCCGAT 

651 GTTCCACAGC TTCACGCTGA CGGCTATGGT GCTGCTGCCG ATTTATATGG 

701 CGTGTTCGAT TATTTTGGTC AAATCCGTTT TTCCGTTTTC CAACGTTTTG 

751 AAACAGACAC TGCTCAAACG CGCGACCGTG TTTTTGGGCG TACCCGCGAT 

801 TTACACCGCG ATGAGCAAGG CGAAMTCCC TTGGTATTTC AGATGGTTCA 

851 ACCGCATTCG CCTGTTTATC AGCGGCGGCG CGCCTTTGGC GGAACAAACC 

901 ATCCTCGATT TCAAAGCCAA GTTCCCCCGC GCCAAATTGC TGGT^GGCTA 
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951 CGGACTGAGC GAAGCCTCTC CCGTCGTCGC CGTCAATACG CCCGAGAGGC 

1001 AAAAAGCCCG CAGCGTCGGC ATCCCCCTGC CCGGTTTGGA AGCCAAAGCC 

1051 GTCGATGAAG AATTGGTCGA AGTGCCGCGC GGCGAAGTGG GCGAACTGAT 

1101 CGTCAGGGGC GGTTCGGTGA TGCGGGGCTA CCTCAATATG CCTGCCGCCA 

1151 CCGATGAAAC CATCGTCAAC GGCTGGTTGA AAACGGGCGA TTTCGTTACC 

1201 ATAGACGAAG ACGGCTTTAT CTTTATCGTC GACCGCAAAA AAGATTTGAT 

1251 TATTTCCAAA GGTCAAAATG TCTATCCGCG CGAGATTGAA GAAGAAATCT 

1301 ACAAACTCGA TGCCGTCGAA GCCGCCGCCG TCATCGGCGT GAAAGACCGT 

1351 TATGCCGACG AGGAAATCGT CGCCTTCGTC CAATTGAAGG AAGGTATGGA 

14 01 TTTGGGCGAG AACGA/VATCC GCCGCCACCT GCGTACCGTG CTGGCAAATT 

1451 TCAAAATCCC CAAACAAATC CACTTTAAAG ACGGGCTGCC GCGCAACGCT 

1501 ACGGGCAAGG TATTGAAACG GGTGTTGAAG GAGCAGTTTG ACGGAAACAA 

1551 ATGA 

This corresponds to the amino acid sequence <SEQ ID 682; ORF 164>: 

ml 64 .pep 

1 MNRTYANFYE MLAAACRKNG NGTAVFDGKE KTAYRALKQE AEAVAAYLQN 

51 IGVKFGDTVA LAVSNSTE FI TAYFAISAIG AVAVP MNTFL KNSEYAYILN 

101 DCKARFLFAS AGLSKELAGL KAQTPVEKII WTDKSRPTGE TAEGDAFFED 

151 VRRFPEKPDL GRQPRINDLA HIIYTSGTTG HPKGALISYA NLFANLNGIE 

201 RIFKISKRDR FIVFLPMFHS FTL TAMVLLP lYMACSIILV KSVFPFSNVL 

251 KQTLLKRATV FLGVPAIYTA MSKAKIPWYF RWFNRIRLFI SGGAPLAEQT 

301 ILDFKAKFPR AKLLEGYGLS EASPWAVNT PERQKARSVG IPLPGLEAKA 

351 VDEELVEVPR GEVGELIVRG GSVMRGYLNM PAATDETIVN GWLKTGDFVT 

401 IDEDGFIFIV DRKKDLIISK GQNVYPREIE EEIYKLDAVE AAAVIGVKDR 

451 YADEEIVAFV QLKEGMDLGE NEIRRHLRTV LANFKIPKQI HFKDGLPRNA 

501 TGKVLKRVLK EQFDGNK* 



Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from A^. gonorrhoeae ml64 / gl64 98.6% identity in 432 aa 

overlap 



60 70 80 90 100 110 

ml64 . pep GDTVALAVSNSTEFITAYFAISAIGAVAVPMNTFLKNSEYAYILNDCKARFLFASAGLSK 

M I I I I I I I I I I I M I M I t I I t I I M I I I 
gl64 MNTFLKNSEYAYILNDCK7VRFLFASAGLSK 

10 20 30 

120 130 140 150 160 170 

ml64 . pep ELAGLKAQTPVEKIIWTDKSRPTGETAEGDAFFEDVRRFPEKPDLGRQPRINDLAHIIYT 
lllllllllllllllllllll|:Mfllllllll:|||lllllllllllllllillMII 
gl64 ELAGLKAQTPVEKIIWTDKSRPAGETAEGDAFFENVRRFPEKPDLGRQPRINDLAHIIYT 
40 50 60 70 80 90 



180 190 200 210 220 230 

ml64.pep SGTTGHPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMAC 
i f I I I i I I I I I I I i I I I I 1 I I I t I I I I ) I I I I I I I I I I I I t i I I I I I I I I I I I I I I I I I I 
gl64 SGTTGHPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMAC 
100 110 120 130 140 150 



240 250 260 270 280 290 

ml64 .pep SIILVKSVFPFSNVLKQTLLKRATVFLGVPAIYTAMSKAKIPWYFRWFNRIRLFISGGAP 
I i I I ) t I I I I I I I I I t I : I I I t I I I I I I I I I t I I I I I I t I I I t I I I I I I I I I I I I I I I I I 
gl64 SIILVKSVFPFSNVLKQALLKRATVFLGVPAIYTAMSKAKIPWYFRWFNRIRLFISGGAP 
160 170 180 190 200 210 



300 310 320 330 340 350 

inl64 .pep LAEQT I LDFK7VKFPR7UCLLEGYGLSEASPWAVNT PERQKARSVG I PLPGLEAKAVDEEL 

I M M I M I I I M M I I I 1 I I I I I I i I I I M I I I I I I I I I I I I I I I I I i I I I I I I M I I I 
gl64 LAEQTILDFKAKFPRAKLLEGYGLSEASPWAVNTPERQKARSVGIPLPGLEAKAVDEEL 
220 230 240 250 260 270 

360 370 380 390 400 410 
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ml 64 . pep VEVPRGEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKD 
I I M I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
gl64 VEVPRGEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKD 
280 290 300 310 320 330 

420 430 440 450 460 470 

ral64 .pep LIISKGQNVYPREIEEEIYKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGENEIRR 
I M M M I I I I I I I I I I I : I I I t I I I I I I I I i I I I I I I I t I I I I I I I I M I I I I I : I I I I 
gl64 LIISKGQNVYPREIEEEIHKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGEDEIRR 
340 350 360 370 380 390 

480 490 500 510 

inl64 . pep HLRTVLANFKIPKQIHFKDGLPRNATGKVLKRVLKEQFDGNKX 
IMMMMIItlllllllillllllllMlllllllhllll 
g 1 6 4 HLRT VLAN FKI PKQ IHFKDGLPRNATGKVLKRVLKEQFEGNKX 

400 410 420 430 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 683>: 

al64 . seq 

1 ATGAACCGGA CTTATGCCAA TTTCTACGAA ATGCTGACCG CCGCCTGCCG 

51 CAAAAACGGC AACGGCACGG CAGTGTTCGA CGGCAAGGAA AAAACCGCCT 

101 ACCGCGCGCT CAAGCAGGAA GCCGAAGCCG TTGCGGCGTA TCTGC/WW^T 

151 ATCGGCGTGA AGTTCGGCGA CACGGTCGCG CTGGCGGTTT CCAATTCCAC 

201 GGAATTTATT ACCGCCTATT TCGCCGTATC CGCCATCGGC GCGGTTGCCG 

251 TACCGATGAA CACATTTTTG AAAAACAGCG AATACGCGTA TATCCTGAAC 

301 GACTGCAAGG CGCGCTTCCT GTTCGCCTCG GCCGGCCTGT CAAAAGAATT 

351 GGCGGGCTTG AAGGCGCAAA CGCCCGTCGA AAAAATCATT TGGACGGGCC 

401 AT^GCCGTCC GGACGGCGAA ATGGCGGAAG GCGATGCCTT TTTTGAAGAC 

451 GTGCGCCGCT TCCCCGAAAA ACCCGACTTG GGCCGCCAAC CCCGGATAAA 

501 TGATTTGGCA CACATCATCT ACACCTCCGG CACGACGGGG CATCCCAAAG 

551 GTGCGCTAAT CAGCTACGCC AACCTGTTCG CCAACCTGAA CGGCATCGAA 

601 CGCATCTTTA AAATCTCCAA GCGCGACCGC TTTATCGTTT TCCTGCCGAT 

651 GTTCCACAGC TTCACGCTGA CGGCTATGGT GCTGCTGCCG ATTTATATGG 

701 CGTGTTCGAT TATTTTGGTC TW^TCCGTTT TCCCCTTTTC CAACGTTTTG 

751 AAACAGGCAC TGCTCAAACG CGCGACCGTG TTTTTGGGCG TGCCCGCGAT 

801 TTACACCGCG ATGAGCAAGA CGAAAATCCC TTGGTATTTC AGATGGTTCA 

851 ACCGCATCCG CCTGTTTATC AGCGGCGGAG CACCTTTGGC GGAACAAACC 

901 ATCCTCGATT TCAAAGCCAA GTTCCCCCGC GCCAAATTGC TGGAAGGCTA 

951 CGGACTGAGC GAAGCCTCGC CCGTCGTCGC CGTCAATACG CCCGAGAGGC 

1001 AAAAAGCCCG CAGCGTCGGC ATCCCCCTGC CCGGTTTGGA AGTCAAAGCC 

1051 GTCGATGAAG AATTGGTCGA AGTGCCGCGC GGCGAAGTGG GCGAACTGAT 

1101 CGTCAGGGGC GGTTCGGTGA TGCGGGGCTA CCTCAATATG CCTGCCGCCA 

1151 CCGATGAAAC CATCGTCAAC GGCTGGTTGA AAACGGGCGA TTTCGTTACC 

1201 ATAGACGAAG ACGGCTTTAT CTTTATCGTC GACCGCAAAA AAGATTTGAT 

1251 TATTTCCAAA GGTCAAAATG TCTATCCGCG CGAAATCGAA GAAGAAATCT 

1301 ACAAACTCGA TGCCGTCGAA GCCGCCGCCG TCATCGGCGT GAAAGACCGT 

1351 TATGCCGACG AGGAAATCGT CGCCTTCGTC CAATTGAAGG AAGGTATGGA 

14 01 TTTGGGCGAG AACGAAATCC GCCGCCACCT GCGTACCGTG CTGGCAAATT 

1451 TCAAAATCCC CAAACAAATC CACTTTAAAG ACGGGCTGCC GCGCAACGCT 

1501 ACGGGCAAGG TATTGAAACG GGTGTTGAAG GAGCAGTTTG ACGGAAACAA 

1551 ATGA 

This corresponds to the amino acid sequence <SEQ ED 684; ORF 164.a>: 

al64 .pep 

1 MNRTYANFYE MLTAACRKNG NGTAVFDGKE KTAYRALKQE AEAVAAYLQN 

51 IGVKFGDTVA LAVSNSTE FI TAYFAVSAIG AVAVP MNTFL KNSEYAYILN 

101 DCKARFLFAS AGLSKELAGL KAQTPVEKII WTGQSRPDGE MAEGDAFFED 

151 VRRFPEKPDL GRQPRINDLA HIIYTSGTTG HPKGALISYA NLFANLNGIE 

201 RIFKISKRDR FIVFLPMFHS FTL TAMVLLP lYMACSIILV KSVFPFSNVL 

251 KQALLKRATV FLGVPAIYTA MSKTKIPWYF RWFNRIRLFI SGGAPLAEQT 

301 ILDFKAKFPR AKLLEGYGLS EASPWAVNT PERQKARSVG IPLPGLEVKA 

351 VDEELVEVPR GEVGELIVRG GSVMRGYLNM PAATDETIVN GWLKTGDFVT 

401 IDEDGFIFIV DRKKDLIISK GQNVYPREIE EEIYKLDAVE AAAVIGVKDR 

451 YADEEIVAFV QLKEGMDLGE NEIRRHLRTV LANFKIPKQI HFKDGLPRNA 
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501 TGKVLKRVLK EQFDGNK* 
ml64/al64 98.3% identity in 517 aa overlap 

10 20 30 40 50 60 

ml 64 . pep MNRTYANFYEMLAAACRKNGNGTAVFDGKEKTAYRALKQEAEAVAAYLQNIGVKFGDTVA 
t I I I I I I t I I I I : I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
al64 MNRTYANFYEMLTAACRKNGNGTAVFDGKEKTAYRALKQEAEAVAAYLQNIGVKFGDTVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 64 . pep LAVSNSTEFITAYFAISAIGAVAVPMNTFLKNSEYAYILNDCKARFLFASAGLSKELAGL 
I I I I I I I I i I I M I I : I I M I I I I i I I I I I I I I t I I I I I I I I i I I I I I I I I t I I I I I I I I 
al64 LAVSNSTEFITAYFAVSAIGAVAVPMNTFLKNSEYAYILNDCKARFLFASAGLSKELAGL 

70 80. 90 100 110 120 

130 140 150 160 170 180 

ml64 . pep KAQTPVEKIIWTDKSRPTGETAEGDAFFEDVRRFPEKPDLGRQPRINDLAHIIYTSGTTG 
I I I I I I I I M I I : I I I II I I I I I I I I I I I I I I I I I i I I I I I I M I I I I I I I t I I I I I 
al64 KAQTPVEKIIWTGQSRPDGEMAEGDAFFEDVRRFPEKPDLGRQPRINDIiAHIIYTSGTTG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 64 .pep HPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMACSIILV 

M I I t I I I M I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I 

al64 HPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMACSIILV 

190 200 210 220 230 . 240 

250 260 270 280 290 300 

ml 64 . pep KSVFPFSNVLKQTLLKRATVFLGVPAIYTAMSKAKIPWYFRWFNRIRLFISGGAPLAEQT 

IIMMMMII:!! I I I I I i I I I I I I I : I I I I I 1 I I I I I I I I 

al64 KSVFPFSNVLKQALLKRATVFLGVPAIYTAMSKTKIPWYFRWFNRIRLFISGGAPLAEQT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 64 . pep ILDFKAKFPRAKLLEGYGLSEASPWAVNTPERQKARSVGI PLPGLEAKAVDEELVEVPR 

I I I I I I I I I I I I I I I t I I I I I I i I I I I I I I I I I I I I I I I I I I { I I I I : I I I I I I I I I I M 
al64 ILDFKAKFPRAKLLEGYGLSEASPWAVNTPERQKARSVGIPLPGLEVKAVDEELVEVPR 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml64 . pep GEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKDLIISK 
I t I I I I I I I I M I I I I I I I I I I I I M ! I I I M I 1 I I I I I I I I I I I I I I I I I I I I I i I j ] i 
al64 GEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKDLIISK 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 64 . pep GQNVYPREIEEEI YKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGENEIRRHLRTV 

I I I i I I I I I I I i I i I I I I I I I I I I I I I I I I I I I 1 I I I i I I I i I I I I I I I I I I I I I i I I i I 
al64 GQNVYPREIEEEIYKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGENEIRRHLRTV 

430 440 450 460 470 480 

490 500 510 

ml 64 . pep LANFKIPKQIHFKDGLPRNATGKVLKRVLKEQFDGNKX 
I I I I t I I I i I I I I I I I I I I I I I I I I I I i I M I I I I I I I 
al64 LANFKIPKQIHFKDGLPRNATGKVLKRVLKEQFDGNKX 

490 500 510 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 685>: 

gl65 . seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 
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51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAagat gXGGCGTTGG AATCGTCAAA cGCGTGGAAC 

151 AACGcCGgca CGGGGCATTC CGcGCTGTGc GAATTGAACT AtgcgccGCT 

201 GGGtgcggac ggcgtcatCA ATCCGGCGCg cgCCCTGAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGcga cgctgGTCGC GGAAGGCAAG 

301 TTGGAagaCA ATTCCTTCAT CAATGCcgtg ccgcatatGT Ctttggtgat 

351 gAacgaagac cactgCCgtt acCTGCAAAA ACGCTATGAT GTGTTTAAAA 

401 CGCAGAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CtccgCTGAT TATGCGCGGC CGGgacgaaA ACCAACCCGT 

501 CGCCGCCAAC TATTCCGCCG Aaggcacgga tgtcgATTTC GGACGGCTGA 

551 CGCGCCAGAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 MCCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCAGACTG GCAGCTCACC CTCCGCACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CACTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTACCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTAGACGG CAAACGACAC CTTATGTTCG GTCCTTACGC 

951 AGGTTTCCGT TCCAACTTCC TCAAGCAAGG CTCGTTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CcTGCTGGgC gAaTTGCgtt aa 



This corresponds to the amino acid sequence <SEQ ID 686; ORF 165.ng>: 

gl65 .pep 

1 MAEAT DWLV GGGIMSATLG VLL KEIiEPSW EITLIERLED VALESSNAWN 



51 NAGTGHSALC ELNYAPLGAD GVINPARALN lAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCRYLQKRYD VFKTQKLFEN MEFSTDRNKI 

151 SDWAPLIMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDWQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GLPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSFM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELR* 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 687>: 

ml65.8eq (partial) 



1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGCTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGgTACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTCCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCGCTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGTGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAA. . . 



This corresponds to the amino acid sequence <SEQ ID 688; ORF 165>: 

ml65.pep (partial) 

1 MAEATD WLV GGGIMSATLG VLL KELEPSW EITLIERLED VALESSNAWN 



51 NAGTGHSALC ELNYAPLGAN GIIDPARALN lAEQFHVSRQ FWATLVAEGK 
101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 
151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 
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201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLCAGWA 

351 NMPLTK. . . 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml65 / gl65 97.2% identity in 356 aa overlap 

10 20 30 40 50 60 

ml 65 . pep MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
I I I M I I I I I I I I i 1 i I I I I I I I I I I I I I I I I I I [ I I I I I I I I I I I I I I I I I i I I I I I I I 
gl65 MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml65 . pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
I I I I I I I I I : I : I : I I I I I I 1 I I I I I i I I I I I I I I I 1 M I I I I t I I I I I I I I i I I I I I I I 
gl65 ELNYAPLGADGVINPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 65 . pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDF 
tl llllill:ltlllllllllllllllllllllll:|llllllllllllll}ll|jMI 
gl65 HCRYLQKRYDVFKTQKLFENMEFSTDRNKISDWAPLIMRGRDENQPVAANYSAEGTDVDF 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 65 . pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
I t M I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I i I I I I I I I M I t I I I I I I I I I I I I I 
gl65 GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDWQLTLRTRFLFLGA 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 65 . pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
I i I I I I I i I I I I I I I I I I I I I: I I I I I I I i I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
gl65 GGGALTLLQKSGIPEGKGYGGLPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 

310 320 330 340 350 

ml 65 . pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTK 
I I I I I I I I I I I ) I I I I i I I I I I I I I I I I : I I I I I I t I I I I I I I I I M I I I I I M I 
gl65 DTRNVDGKRHLMFGPYAGFRSNFLKQGSFMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

310 320 330 340 350 360 

gl65 ELRX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 689>: 

al65 . seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGTTGGTCGC GGAAGGC/^G 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG . GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 
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601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 



AACCGCCACG 
CAAAACCGCC 
GCTTCCTCTT 
TCCGGCATCC 
GTTCTTCCGC 
TGTACGGGCA 
GACACACGCA 
AGGCTTCCGT 
TGTCCATCCA 
AATATGCCGC 
AGAACGCTTC 
ACTGGGAACT 
TCCGAAAAAG 
CGACGGCTCG 
CCGTGCCGCT 
CCGTCTTGGG 
GTTGAACGAA 
AAGTGTTGGA 



TCGAAGACAT 
GATACCCGCA 
CCTCGGCGCG 
CCGAAGGCAA 
AACAGCAACC 
GGCTTCCGTC 
ACGTGGACGG 
TCCAACTTCC 
TATGGACAAC 
TGACCAAATA 
GCCTCCCTGC 
CATCACCGCA 
GCGGCGTGTT 
CTCGCCGCAT 
GATGATCCGG 
AAGGCCGTCT 
AACCCCGAAA 
TATTTAA 



CAAACGCGAA 
ACCCCGACGG 
GGCGGCGGCG 
AGGCTACGGT 
CCGAAACCGC 
GGCGCGCCGC 
CAAACGCCAC 
TCAAGCAAGG 
CTCTATCCTA 
CCTGCTGGGC 
TGGAATACTA 
GGGCAACGCG 
GCAGTTTGGT 
TGCTGGGCGC 
CTGATGCACC 
GAAAGAGCTG 
GGGCGGATGA 



TCCGACGGCG 
GCAGCTCACC 
CGCTGACCCT 
GGCTTTCCCG 
CGAACAACAC 
CGATGTCCGT 
CTTATGTTCG 
CTCACTTATG 
TGCTGGGCGC 
GAATTGCGTA 
CCCCGAGGCA 
TTCAAATCAT 
ACGGAGATTG 
GTCGCCGGGC 
AATGCTTCCC 
GTACCGGGTT 
AATTATCGCC 



CGTGGGTGCT 
CTCCGTACCC 
GCTGCAAAAA 
TGTCCGGCCT 
AACGCCAAAG 
CCCGCACCTC 
GCCCTTACGC 
GATTTGCCGC 
CGGCTGGGCG 
AAACCAAAGA 
AACCCCGACG 
TAAAAAAGAC 
TCGCACACGC 
GCATCGACCG 
CGAACGCACC 
ACGGCATCAA 
TATACCGCGA 



This corresponds to the amino acid sequence <SEQ ID 690; ORF 165. a>: 

al65.pGp 

MAEATD WLV GGGIMSATLG VLL KELEPSW EITLIERLED 

lAEQFHVSRQ 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 



VALES SNAWN 

NAGTGHSALC ELNYAPLGAN GIIDPARALN lAEQFHVSRQ FWATLVAEGK 
LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 
SDWAPLMMRG RDENQPVA?UJ YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 
NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 
SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 
DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLRAGWA 
NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 
SEKGGVLQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERT 
PSWEGRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI* 



ml65/al65 



99.7% identity in 356 aa overlap 



10 20 30 40 50 60 

ml 65 . pep MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
I i I i I I I I I I I I I I I I i I I I I I I I I I I I I 1 I I I i I I I I I I I I I I I I I I I i I I I I ) I I I I I 
al65 MT^ATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml65.pep ELNYAPLGANG II DPARAL^3 1 AEQFHVSRQFWATLVAEGKLEDNSFINAV PHMSLVMNED 

I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I 
al65 ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 65 . pep HCSYLQKRYDAFKTQKLFENMEFSTDRNK I SDWAPLMMRGRDENQPVAAN YSAEGTDVDF 

M I I I I I I I I I I I I I I I I I I I It I I I I I i I I I i I I I I I I I 1 I I I I I M I I I I I I 

al65 HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDF 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 65 . pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLT LRTRFLFLGA 

M I I I I I I M M i I I I I I I I I I I I I I I I i I I I t I i I I i i I I I I I I I I I I I I I I I I ! 1 I I I 
al65 GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 65. pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
H N I I M M I M I I I I I I I I I t I I I I i I I I I I I 1 I I M t I I I I I M i I I I I I I I I I I j I 
al65 GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 
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310 320 330 340 350 

ml 65 . pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTK 
I I I i I I I I I I I I I I I t I I I I I I I I I t I I I I I i I I I I I I I I i I I I I I I I I I I I I I I 
al6.5 DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

310 320 330 340 350 360 

al65 ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 
370 380 390 400 410 420 

The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 691>: 

gl65-1.8aq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAagat gTGGCGTTGG AATCGTCAAA cGCGTGGAAC 

151 AACGcCGgca CGGGGCATTC CGcGCTGTGc GAATTGAACT AtgcgccGCT 

201 GGGtgcggac ggcgtcatCA ATCCGGCGCg cgCCCTGAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGcga cgctggTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAACGAAGAC CACTGCCGTT ACCTGCAAAA ACGCTATGAT GTGTTTAAAA 

401 CGCAGAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CtccgCTGAT TATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TATTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAGAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCAGACTG GCAGCTCACC CTCCGCACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CACTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTACCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTAGACGG CAAACGACAC CTTATGTTCG GTCCTTACGC 

951 AGGTTTCCGT TCCAACTTCC TCAAGCAAGG CTCGTTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTtt gCCTCCCTGC TGgaatacta cccGaggcag acccGACGAc 

1151 tggtactcat cacgcaggnc acGCGTcata tcattanata tgactCgaaa 

1201 ctgcgcgtgc tgcagttgta cgagattgtg ccaCGCGacg ctcgctcgcg 

1251 cattctggag cgtcgcggcg catcacgctn tgcgctgata tccgctgatg 

1301 acactgctcc gaCCGcgccc gtcttggaaa gtgtctga 

This corresponds to the amino acid sequence <SEQ ID 692; ORF 165-l.ng>: 

gl 65-1. pep 

1 MAEATD WLV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAD GVINPARALN lAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCRYLQKRYD VFKTQKLFEN MEFSTDRNKI 

151 SDWAPLIMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDWQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GLPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSFM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPRQ TRRLVLITQX TRHIIXYDSK 

401 LRVLQLYEIV PRDARSRILE RRGASRXALI SADDTAPSAP VLESV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 693>: 

mieS-l.saq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGCTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 G7UVTGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGGTACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTCCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 
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851 TGTACGGGCA GGCTTCCGTC 

901 GACACACGCA ACGTGGACGG 

951 AGGCTTCCGT TCCAACTTCC 

1001 TGTCCATCCA TATGGACAAC 

1051 AATATGCCGC TGACCAAATA 

1101 AGAACGCTTC GCCTCCCTGC 

1151 ACTGGGAACT CATCACCGCA 

1201 TCCGAAAAAG GCGGCGTGCT 

1251 CGACGGCTCA CTCGCCGCAT 

1301 CTGTGCCGCT GATGATCCGG 

1351 CCGTCTTGGG AAGACCGTCT 

1401 GTTGAACGAA AACCCTGAAA 

1451 AAGTATTGGA TATTTAA 



GGCGCGCCGC CGATGTCCGT CCCGCACCTC 
CAAACGCCAC CTTATGTTCG GCCCTTACGC 
TCAAGCAAGG CTCGCTTATG GATTTGCCGC 
CTCTATCCTA TGCTGTGCGC CGGCTGGGCG 
CCTGCTGGGC GAATTGCGTA AAACCAAAGA 
TGGAATACTA CCCCGAGGCA AACCCCGACG 
GGGCAACGCG TCCAAATCAT TAAAAAAGAC 
CCAGTTTGGT ACGGAGATTG TCGCCCACGC 
TGCTGGGCGC GTCGCCGGGC GCATCGACCG 
CTGATGCACC AATGCTTCCC CGAGCGCGCC 
GAAAGAGCTG GTACCGGGTT ACGGCATCAA 
GGGCGGATGA AATTATCGCC TATACCGCGA 



This corresponds to the amino acid sequence <SEQ ID 694; ORF 165-1>: 

ml 65-1. pep 

1 MAEAT DVVLV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN lAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLCAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 

401 SEKGGVLQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERA 

451 PSWEDRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI* 



ml65-l/gl65-l 89.7% identity in 428 aa overlap 



10 20 30 40 50 60 

ml 65-1 . pep MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

iitiiiiiiiiiiiititiiiitiiiiiiiiiiiiiiiiiiiitiitiiitiiititiii 

gl65-l MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml 65-1. pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
lllllllll:|:t:l)lllllltllllllllltlllilllllllllMIIIIIIII)lll 
gl65-l ELNYAPLGADGVINPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 65- 1 . pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAAN YSAEGTDVDF 

II lllllll:IIIIIIIIIIIIIIIIIIMIIIII:ltlllltll Illllilll 

gl65-l HCRYLQKRYDVFKTQKLFENMEFSTDRNKISDWAPLIMRG RDENQPVAAN YSAEGTDVDF 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml65-l.pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 

IIMMIIIIIIMIIIIllllllllllilllllllllllllllll itiintiiiiii 

gl65-l GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDWQLTLRTRFLFLGA 
190 200 210 220 230 240 



, 250 260 270 280 290 300 

ml 65-1. pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
I I I I M I 11 I i II I I I I I I II : I II i I I j I I I II i I I I I I I n I I I I I I I t I I I I I I I I i 
gl65-l GGGALTLLQKSGIPEGKGYGGLPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml 65-1. pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTKYLLG 
IIMIIMIIIIIIIIII)lllllllll:|ltll)llllllllll llllllllllllll 
gl65-l DTRNVDGKRHLMFGPYAGFRSNFLKQGSFMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 65- 1 .pep ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQI XKKDSEKGGVLQFGTEI VAHADGS 

IIIIIIIIIIIIMllll: : III I :|[ II I 111: : | : 

gl65-l ELRKTKEERFASLLEYYPR-QTRRLVLITQXTR-HIIXYDS-KLRVLQLYEIVPRDARSR 

370 380 390 400 410 



430 440 450 460 470 480 
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ml 65-1 .pep LAALLGASPGASTAVPI^IRLMHOCFPERAPSWEDRLKELVPGYGIKLNENPERADEIIA 
: I I I 

gl65-l ILERRGASRXALISADDTAPSAPVLESVX 
420 430 440 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 695>: 

aX65-1.8aq 

1 ATGGCTGAAG C6ACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGTTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGT GGCTTTCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

-901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCACTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTTC GCCTCCCTGC TGGAATACTA CCCCGAGGCA AACCCCGACG 

1151 ACTGGGAACT CATCACCGCA GGGCAACGCG TTCAAATCAT TAAAAAAGAC 

1201 TCCGAAAAAG GCGGCGTGTT GCAGTTTGGT ACGGAGATTG TCGCACACGC 

1251 CGACGGCTCG CTCGCCGCAT TGCTGGGCGC GTCGCCGGGC GCATCGACCG 

1301 CCGTGCCGCT GATGATCCGG CTGATGCACC AATGCTTCCC CGAACGCACC 

1351 CCGTCTTGGG AAGGCCGTCT GAAAGAGCTG GTACCGGGTT ACGGCATCAA 

1401 GTTGAACGAA AACCCCGAAA GGGCGGATGA AATTATCGCC TATACCGCGA 

1451 AAGTGTTGGA TATTTAA 

This corresponds to the amino acid sequence <SEQ ID 696; ORF 165-l.a>: 

al65-l .pep 

1 MAEAT DWLV GGGIMSATLG VLL KELEPSW EITLIERLED VALESSNAWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN lAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQPGjFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 

401 SEKGGVLQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERT 

451 PSWEGRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI* 

al65-l/ml65-l 99.4% identity in 488 aa overlap 

10 20 30 40 50 60 

al 65-1 . pep MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

IIIMMIIIIIIIIIIIt I Mllllllillllllllllllltl 

ml 6 5 - 1 MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALES SNAWNNAGTGHSALC 

10 20 30 40 50 60 

70 80 90 100 110 120 

al65-l . pep EliNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
IIMMIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIilllllllllllll 
ml 65-1 ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

al65-l . pep HC S YLQKRY DAFKTQKLFENMEFSTDRNK I SDWAPLMMRGRDENQPVAAN YSAEGTDVDF 
tllMIIIMIIItltllilllllllllllMlltlllllllltlllllltllllllltl 
ml65-l HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDF 
130 140 150 160 170 180 



190 200 210 220 230 240 
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al65-l .pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
lllllllllllllltlMMIIIIIIIIIIIIIIIIIIIItlllllttllllllllllll 
na65-l GRLTRQMVKYLCX3KGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
190 200 210 220 230 240 

250 260 270 280 290 300 

al 65-1 . pep GGGALTLLQKSGI PEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
llNIHIIIIIIIIIIIIIIIIItllllllllllMIIIIIIIIIIMIItllllllll 
ml 65-1 GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 

310 320 330 340 350 360 

al65-l . pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 
llllllllllillllllllllllltllllllllllinillllll MM Mill II I II 
ml 65-1 DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTKYLLG 

310 320 330 340 350 360 

370 380 390 400 410 420 

a 165-1. pep ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 
MIMIIMMIMIMMIMMMIMMMMMMMIMMMMMIMMMI 
ml 65-1 ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 

370 380 390 400 410 420 

430 440 450 460 470 480 

a 165-1. pep LAALLGASPGASTAVPLMIRLMHQCFPERTPSWEGRLKELVPGYGIKLNENPERADEIIA 
IIMMIIMIMMMMMIMMMMIMI II It II II M M II II M M M II I 
ml 65-1 LAALLGASPGASTAVPLMIRLMHQCFPERAPSWEDRLKELVPGYGIKLNENPERADEIIA 

430 440 450 460 470 480 



al65-l.pep 
ml65-l 



489 

YTAKVLDIX 
I I II M II I 
YTAKVLDIX 



al65-l/p33940 

sp|P33940|YOJH_ECOLI HYPOTHETICAL 60.2 KD PROTEIN IN ECO-ALKB INTERGENIC REGION 
>gi|1736851|gnl|PID|dl016718 (D90850) ORF_ID:o372«5; similar to [SwissProt Accession Number 
P33940} tEscherichia coli] >gi 1 1788539 (AE000310) f548; This 54B aa ORF is 100 pet identical 
to 490 residues of YOJH_ECOLI SW: P33940 (492 aa) but contains 56 additional N-ter aa; 100 pet 
identical to GB: ECOHU49_33 

ACCESSION: U00008 (490 aa) but contains 58 aditional N-term resi... Length = 548 
Score = 458 bits (1167), Expect » e-128 

Identities = 233/490 (47%), Positives = 303/490 (61%), Gaps = 5/490 (1%) 

Query: 3 EATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALCEL 62 

+ TDV+L+GGGIMSATLG L+ELEP W +T++ERLE VA ESSN WNNAGTGHSAL EL 
Sbjct: 30 QETDVLLIGGGIMSATLGTYLRELEPEWSMTMVERLEGVAQESSNGWNNAGTGHSALMEL 89 

Query: 63 NYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLED-NSFINAVPHMSLVMNEDH 121 

NY P A+G I +A+ I E F +SRQFWA V G L SFIN VPHMS V ED+ 
Sbjct: 90 NYTPQNADGSISIEKAVAINEAFQISRQFWAHQVERGVLRTPRSFINTVPHMSFVWGEDN 149 

Query: 122 CSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDFG 181 

++L+ RY A + LF M +S D +1 +WAPL+M GRD Q V7^ + GTDV++G 
Sbjct: 150 VNFLRARYAALQQSSLFRGMRYSEDHAQIKEWAPLVMEGRDPQQKVAATRTEIGTDVNYG 209 

Query: 182 RLTRQMVKYLQGKG-VKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTXXXXXXXXXX 240 

+TRQ++ LQ K + + V +KR D W + AD +N Q 

Sbjct: 210 EITRQLIASLQKKSNFSLQLSSEVRALKRNDDNTWTVTVADLKNGTAQ-NIRAKFVFIGA 268 

Query: 241 XXXXXXXXQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 300 

Q+SGIPE K Y GFPV G F + NP+ H AKVYG+ASVGAPPMSVPH+ 
Sbjct: 269 GGAALKLLQESGIPEAKDYAGFPVGGQFLVSENPDVVNHHLAKVYGKASVGAPPMSVPHI 328 

Query: 301 DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 360 

DTR +DGKR ++FGP+A F + FLK GSL DL S N+ PM+ G N L KYL+ 
Sbjct: 329 DTRVLDGKRWLFGPFATFSTKFLKNGSLWDLMSSTTTSNVMP^3MHVGLDNFDLVKYLVS 388 

Query: 361 ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVXXXXXX 420 

++ ++E+RF +L EYYP+A +DW L AGQRVQIIK+D+EKGGVL+ GTE+V 
Sbjct: 389 QVMLSEEDRFEALKEYYPQAKKEDWRLWQAGQRVQIIKRDAEKGGVLRLGTEWSDQQGT 448 



Query: 421 XXXXXXXXXXXXXXVPLMIRLMHQCFPER— TPSWEGRLKELVPGYGIKLNENPERADEI 478 
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476 



P+M+ L+ + F +R +P W-f LK +VP YG KLN + + 
Sbjct: 449 lAALLGASPGASTAAPIMLNLLEKVFGDRVSSPQWQATLKAIVPSYGRKLNGDVAATERE 508 

Query: 479 lAYTAKVLDI 488 

+ YT++VL + 
Sbjct: 509 LQYTSEVLGL 518 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 697>: 
g204 . seq 

1 atggcggcgg cggaaataaa acgccccctc gctgtcgatt tccagcacat 

51 agcgtccgtt ctgcacggcg gcatagccgc ttttgcctgc ctgatagggt 

101 tgcagggcgg aatgcgaaat caggtaatca gtcagtttgc cgccgtcttc 

151 ggcgatattg cccaccagtt tggcaaacaa ggtatggcac acgccgtttt 

201 ccgcccagcc cgaaggcgcg tcctttccgt cggtttccat acatttgccg 

251 acgacggctt ccaagtcgtt gggatgcttt ccggtcagcc ggacggcgtt 

301 ttgttccggc aagcctttaa tcggataact gatttgtttt ttgccgtcgt 

351 tggttttgcc ttcgctactt tgtcccaaag ccaaaccggc aatcgccgta 

401 ttgtcgatgt atttgacttt gaaaaccggt ttcggcgcgc tttgtgccgc 

451 attttgcggc tgttccgccg tattttcgga tttgccgcag gcggcaagca 

501 gcaggcagcc gcccaacacg gcaaaaggta ttttcagcat tccgcactcc 

551 tgatggtttc aaaatgccgt ctgaaatgcc gtctgaaacg tggcaggcgg 

601 aggttcggac ggcattgggt ttatttcaac gggcggatgc cgaccgcatc 

651 gcgtacttta tccaacaatt cgcgcgcttc tttgcgcgct ttttgcgcgc 

701 ctgcctgcaa aatctcttcg atttgcgaag gattagaggt caatgcgttg 

751 tag 

This corresponds to the amino acid sequence <SEQ ID 698; ORF 204.ng>: 

g204 .pep 

1 MAAAEIKRPL AVDFQ HIASV LHGGIAAFAC LIG LQGGMRN QVISQFAAVP 

51 GDIAHQPGKQ GMAHAVFRPA RRRVLSVGFH TFADDGFQW GMLSGQPDGV 

101 LFRQAFNR IT DLFFAWGFA FATL SQSQTG NRRIVDVFDF ENRFRRALCR 

151 ILRLFRRIFG FAAGGKQQAA AQHGKRYFQH SALLMVSKCR LKCRLKRGRR 

201 RFGRHWVYFN GRMPTASRTL SNNSRASLRA FCAPACKISS ICEGLEVNAL 

251 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 699>: 

m204 . seq 

1 ATGGCGGCGG CGGAAATAAA ACGCCCCTTC GCTGTCGATT TCCAGCACAT 

51 AGCGTCCGTT CTGCACGGCG GCATAGCCGC TTTTGCCTGC CTGATAGGGT 

101 TGCAGGGCGG CATGCGAAAC TAGGTAATCC GTCAGTTTGC CGCCGTCTTC 

151 GGCGATATTG CCCACCAGTT TGGCAAACAA GGTATGGCAC ACGCCGTTTT 

201 CTGCCCAACC TGCCGGACTG TCCTTATCAT CGGTTTCCAT ACATTTGCCG 

251 CTGACGGCTT CCAAGTCGCC GGGATGCTTG CCGATCAGTC GGATAACATT 

301 TTGTTCCGGC AAGCCTTTAA TCGGATAACT GATTTGTTTT TTGCCGTCGT 

351 TGGTTTTGCC TTCGCTGCTT TGTCCCAAAT CCAAACCGGC AATCGCCGTA 

401 TTGTCGATAT ATATGACTTT GAAAACCGGT TTCGGCGCGC TTTGTACCGC 

4 51 GTTTTGCGGC TGTACCGCCG TATTTwCGGA TTTGCCGCaC GGCaArGCAG 

501 CAGGCAGCCG CCCAATACGG CAAAArAwGT wTTCAGCATT CCACAyTCCT 

551 GATGGTTTCA AAATGCCGTC TGAAACGCGG CAGGCGGAGG TTCGGACGGC 

601 ATCGGGTTCA TTTCAACGGG CGGATGcCGA CCGCATCgGT ACTTTGTCCA 

651 ATAATTCGCG TGCTTCTTTA CGCGCTTTCG CCGCGCCTGC CTGCAAAATC 

701 TCTTCGATTT GCGAAGGGTC GGCGGTCAGC TCGTTGTAG 

This corresponds to the amino acid sequence <SEQ ID 700; ORF 204>: 
m204 .pep 

1 MAAAEIKRPF AVDF OHIASV LHGGIAAFAC LIG LQGGMRN *VIRQFAAVF 

51 GDIAHQFGKQ GMAHAVFCPT CRTVLIIGFH TFAADGFQVA GMLADQSDNI 

101 LFRQAFN RIT DLFFAWGFA FAAL SQIQTG NRRIVDIYDF ENRFRRALYR 

151 VLRLYRRIXG FAATAXQQAA AQYGKXXXQH STXLMVSKCR LKRGRRRFGR 

201 HRVHFNGRMP TASGTLSNNS RASLRAFAAP ACKISSICEG SAVSSL* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from A^. gonorrhoeae 

ORF 204 shows 82.0% identity over a 250 aa overlap with a predicted ORF (ORF 204.ng) 
from N. gonorrhoeae: 

m204/g204 

10 20 30 40 50 60 

m204 . pep MAAAEIKRPFAVDFQHIASVLHGGIAAFACLIGLQGGMRNXVIRQFT^VFGDIAHQFGKQ 

llllllllhllllllllllllllMIIIIIIIIIIIIII II llllllllllllllll 

g204 MAAAEIKRPLAVDFQHIASVLHGGIAAFACLIGLOGGMRNQVXSQFAAVFGDIAHQFGKQ 
10 20 30 40 50 60 

70 80 90 100 110 120 

m204 .pep GMAHAVFCPTCRTVLIIGFHTFAADGFQVAGMLADQSDNILFRQAFNRITDLFFAWGFA 
Illllll h I II :|||||l llllhlll: I |::|||||lillllll||||||l 
g2 04 GMAHAVFRPARRRVLS VGFHTFADDGFQWGMLSGQPDGVLFRQTIFNRITDLFFAWGFA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m204 . pep FAALSQIQTGNRRIVDIYDFENRFRRALYRVLRLYRRIXGFAATAXQQAAAQYGKXXXQH 
Ihlll llllllllh:|IMIIIIII hllhlll nil : IIIMhIl II 
g2 04 FATLSQSQTGNRRIVDVFDFENRFRRALCRILRLFRRI FGFAAGGKQQAAAQHGKRYFQH 

130 140 150 160 170 180 

190 200 210 220 230 

m204 . pep STXLMVSKCRLK RGRRRFGRHRVHFNGRMPTASGTLSNNSRASLRAFAAPACKISS 

h lllllllll lllllllll hlllllllll lllllllllllll llllllll 
g2 04 SALLMVSKCRLKCRLKRGRRRFGRHWVYFNGRMPTASRTLSNNSRASLRAFCAPACKISS 

190 200 210 220 230 240 • 



m204 .pep 
g204 



240 
ICEGSAVSSLX 

nil h:| 
ICEGLEVNAL 
250 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 701>: 



a204 .seq 












1 


ATGGCGGCGG 


CGGAAATAAA 


ACGCCCCCTC 


GCTGTCGATT 


TCCAGCACAT 


51 


AGCGTCCGTT 


CTGCACGGCG 


GCATAGCCGC 


TTTTGCCTGC 


CTGATAGGGT 


101 


TGCAGGGCGG 


AATGCGAAAT 


CAGGTAATCC 


GTCAGTTTGC 


CGCCGTCTTC 


151 


GGCGATATTG 


CCCACCAGTT 


TGGCAAACAA 


GGTATGGCAC 


ACGCCGTTTG 


201 


CCGCCCAGCC 


CGAAGGCGCG 


CCCTTTCCGT 


CGGTTTCCAT 


ACATTTGCCG 


251 


ACGACGGCTT 


CCAAGTCGTT 


GGGATGCTTG 


CCGGTCAGCC 


GGACGACGTT 


301 


TTGTTCCGGC 


AAGCCTTT 








351 










401 












451 












501 












551 










GTTCGGACGG 
GTACTTTGCC 


601 


CATTGGGTTT 


ATTTCAACGG 


GCGGATACCG 


ACCGCATCAC 


651 


CAATAATTCG 


CGTGCTTCTT 


TACGCGCTTT 


TTGCGCGCCT 


GCCTGCAAAA 


701 


TCTCTTCGAT 


TTGCGAAGGG 


TCGGCGGTCA 


GCTCGTTGTA 


G 



This corresponds to the amino acid sequence <SEQ ID 702; ORF 204,a>: 

a204.pep 

1 MAAAEIKRPL AVDFQ HIASV LHGGIAAFAC LIG LOGGMRN QVIRQFAAVF 

51 GDIAHQFGKQ GMAHAVCRPA RRRALSVGFH TFADDGFQW GMLAGQPDDV 

101 LFRQAF 

151 KRFGR 

201 HWVYFNGRIP TASRTLPNNS RASLRAFCAP ACKISSICEG SAVSSL* 
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Iii204/a204 54.5% identity in 246 aa overlap 

10 20 30 40 50 60 

in2Q4 .pep MAAAEIKRPFAVDFQHIASVLHGGIAAFACLIGLQGGMRNXVIRQFAAVFGDIAHQFGKQ 

IMIIIIIIMIIIIIIIIIIIIIIIIMIII Illllllllllllllll 

a204 MAAAEIKRPLAVDFQHIASVLHGGIAAFACLIGLQGGMRNQVIRQFAAVFGDIAHQFGKQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

m204 . pep GMAHAVFCPTCRTVLIIGFHTFAADGFQVAGMLADQSDNILFRQAFNRITDLFFAWGFA 

tlllll I: I :l rllllll IIIIMIIII I |::ltllll 
a2 0 4 GMAHAVCRPARRRALSVGFHTFADDGFQVVGMLAGQPDDVLFRQAF 

70 80 90 100 



130 140 150 160 170 180 

m204 . pep FAALSQIQTGNRRIVDI YDFENRFRRALYRVLRLYRRIXGFAATAXQQAAAQYGKXXXQH 

a204 



190 200 210 220 230 240 

m204 .pep STXLMVSKCRLKRGRRRFGRHRVHFNGRMPTASGTLSNNSRASLRAFAAPACKISSICEG 

tlllll l:ll)l:lltl II llllllllll llllllllllll 

a204 KRFGRHWVYFNGRIPTASRTLPNNSRASLRAFCAPACKISSICEG 

110 120 130 140 150 



m204.pep SAVSSLX 
I I M I II 

a204 SAVSSLX 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 703>: 

g205.seq 

1 atgctgaaaa taccttttgc cgtgttgggc ggctgcctgc tgcttgccgc 

51 ctgcggcaaa tccgaaaata cggcggaaca gccgcaaaat gcggcacaaa 

101 gcgcgccgaa accggttttc aaagtcaaat acatcgacaa tacggcgatt 

151 gccggtttgg ctttgggaca aagtagcgaa ggcaaaacca acgacggcaa 

201 aaaacaaatc agttatccga ttaaaggctt gccggaacaa aacgccgtcc 

251 ggctgaccgg aaagcatccc aacgacttgg aagccgtcgt cggcaaatgt 

301 atggaaaccg acggaaagga cgcgccttcg ggctgggcgg aaaacggcgt 

351 gtgccatacc ttgtttgcca aactggtggg caatatcgcc gaagacggcg 

401 gcaaactgac tgattacctg atttcgcatt ccgccctgca accctatcag 

451 gcaggcaaaa gcggctatgc cgccgtgcag aacggacgct atgtgctgga 

501 aatcgacagc gagggggcgt tttatttccg ccgccgccat tattga 

This corresponds to the amino acid sequence <SEQ ID 704; ORF 205.ng>: 
g20S .pep 

1 MLKIPFAVLG GCLLIAA CGK SENTAEQPQN AAQSAPKPVF KVKYIDNTAI 

51 AGLALGQSSE GKTNDGKKQI SYPIKGLPEQ NAVRLTGKHP NDLEAWGKC 

101 METDGKDAPS GWAENGVCHT LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ 

151 AGKSGYAAVQ NGRYVLEIDS EGAFYFRRRH Y 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 705>: 

m2a5. seq 

1 ATGCTGAAwA CwTyTTTTGC CGTATTGGGC GGCTGCCTGC TGCyTtGCCG 

51 tGCGGCAAAT CCGwAAATAC GGCGGTACAG CCGCAAAACG CGGTACAAAG 

101 CGCGCCGAAA CCGGTTTTCA AAGTCATATA TATCGACAAT ACGGCGATTG 

151 CCGGTTTGGA TTTGGGACAA AGCAGCGAAG GCAAAACCAA CGACGGCAAA 

201 AAACAAATCA GTTATCCGAT TAAAGGCTTG CCGGAACAAA ATGTTATCCG 

251 ACTGATCGGC AAGCATCCCG GCGACTTGGA AGCCGTCAGC GGCAAATGTA 

301 TGGAAACCGA TGATAAGGAC AGTCCGGCAG GTTGGGCAGA AAACGGCGTG 

351 TGCCATACCT TGTTTGCCAA ACTGGTGGGC AATATCGCCG AAGACGGCGG 

401 CAAACTGACG GATTACCTAG TTTCGCATGC CGCCCTGCAA CCCTATCAGG 

451 CAGGCAAAAG CGGCTATGCC GCCGTGCAGA ACGGACGCTA TGTGCTGGAA 

501 ATCGACAGCG AAGGGGCGTT TTATTTCCGC CGCCGCCATT ATTGA 
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This corresponds to the amino acid sequence <SEQ ID 706; ORF 205>: 

m205.pep 

1 MLXTXFAVLG GCLLX CRCGK SXNTAVQPQN AVQSAPKPVF KVIYIDNTAI 

51 AGLDLGQSSE GKTNDGKKQI SYPIKGLPEQ NVIRLIGKHP GDLEAVSGKC 

101 METDDKDSPA GWAENGVCHT LFAKLVGNIA EDGGKLTDYL VSHAALQPYQ 

151 AGKSGYAAVQ NGRYVLEIDS EGAFYFRRRH Y* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 205 shows 88.4% identity over a 181 aa overlap with a predicted ORF (ORF 205.ng) 
from N. gonorrhoeae: 

m205/g205 

10 20 30 40 50 60 

MliXTXFAVLGGCLLXCRCGKSXNTAVQPQNAVQSAPKPVFKVIYIDNTAIAGLDIiGQSSE 
II lllllllll llll III llllhllllllllll llllllllll llllll 
MLKIPFAVLGGCLLliAACGKSENTAEQPQNAAQSAPKPVFKVKYIDNTAIAGLALGQSSE 
10 20 30 40 50 60 

70 80 90 100 110 120 

GKTNDGKKOISYPIKGLPEQNVIRLIGKHPGDLEAVSGKCMETDDKDSPAGWAENGVCHT 

llllllllllllllllllllhHI lllhlllll lllllll IhhIIIMIIIII 

GKTNDGKKQISYPIKGLPEQNAVRLTGKHPNDLEAWGKCMETDGKDAPSGWAENGVCHT 
70 80 90 100 110 120 

130 140 150 160 170 180 

m205 .pep LFAKLVGNIAEDGGKLTDYLVSHAALQPYQAGKSGYAAVQNGRYVLE IDS EGAFYFRRRH 

liillMMIillllMllhlhlllMllllllllllllllllllllllllllllill 

g205 LFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQNGRYVLEIDSEGAFYFRRRH 
130 140 150 160 170 180 



m205 .pep 
g205 



Tn205 .pep 
g205 



m205.pep YX 
I 

g205 Y 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 707>: 

a205.seq (partial) 

1 TCCGAACCTC TTAAAGGCTT GCCGGAACAA AACGTCGTCC GGCTGACCGG 

51 CAAGCATCCC AACGACTTGG AAGCCGTCGT CGGCAAATGT ATGGAAACCG 

101 ACGGAAAGGG CGCGCCTTCG GGCTGGGCGG CAAACGGCGT GTGCCATACC 

151 TTGTTTGCCA AACTGGTGGG CAATATCGCC GAAGACGGCG GCAAACTGAC 

201 GGATTACCTG ATTTCGCATT CCGCCCTGCA ACCCTATCAG GCAGGCAAAA 

251 GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGCTGGA AATCGACAGC 

301 GAGGGGGCGT TTTATTTCCG CCGCCGCCAT TATTGA 

This corresponds to the amino acid sequence <SEQ ID 708; ORF 205.a>: 

a205.pep (partial) 

1 SEPLKGLPEQ NWRLTGKHP NDLEAWGKC METDGKGAPS GWAANGVCHT 

51 LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ AGKSGYAAVQ NGRYVLEIDS 

101 EGAFYFRRRH Y* 

m205/a205 88.3% identity in 1 1 1 aa overlap 

50 60 70 80 90 100 

m205.pep KVIYIDNTAIAGLDLGQSSEGKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKC 

I I : I I I I I I I I : I I I i I i : I I I I I III 
a205 SEPLKGLPEQNWRLTGKHPNDLEAWGKC 

10 20 30 



110 120 130 140 150 160 

m205 . pep METDDKDSPAGWAENGVCHTLFAKLVGNIAEDGGKLTDYLVSHAALQPYQAGKSGYAAVQ 
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Mill IIIIIIMMItlllll)lttlllll:|l:||IMIIIIIIIIIII 
a205 METDGKGAPSGWAANGVCHTLFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQ 
40 50 60 70 80 90 

170 180 
m205 . pep NGRYVLEIDSEGAFYFRRRHYX 
I I I I I I I I I I I I I I I I I I I I I I 
a 2 0 5 NGR YVLE I DSEGAFYFRRRHYX 

100 110 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 709>: 

g205-1.8eq (partial) 

1 ATGCTGAAAA TAcCTTTTGC CGTGTTGGGC 

51 CTGCGGCAAA TCCGAAAATA CGGCGGAACA 

101 GCGCGCCGAA ACCGGTTTTC AAAGTCAAAT 

151 GCCGGTTTGG CTTTGGGACA AAGTAGCGAA 

201 AAAACAAATC AGTTATCCGA TTAAAGGCTT 

251 GGCTGACCGG AAAGCATCCC AACGACTTGG 

301 ATGGAAACCG ACGGAAAGGA CGCGCCTTCG 

351 GTGCCATACC TTGTTTGCCA AACTGGTGGG 

401 GCAAACTGAC TGATTACCTG ATTTCGCATT 

451 GCAGGCAAAA GCGGCTATGC CGCCGTGCAG 

501 AATCGACAGC GAGGGGGCGT TTTA 

This corresponds to the amino acid sequence <SEQ ID 710; ORF 205-1 .ng>: 

g205-l.pep (partial) . 

1 MLKIPFAVLG GCLLLAA CGK SENTAEQPQN AAQSAPKPVF KVKYIDNTAI 
51 AGLALGQSSE GKTNDGKKQI SYPIKGLPEQ NAVRLTGKHP NDLEAWGKC 
101 METDGKDAPS GWAENGVCHT LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ ' 
151 AGKSGYAAVQ NGRYVLEIDS EGAF 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 71 1>: 

m20S-l.seq. . 

1 ATGCTGAAAA CATCTTTTGC CGTATTGGGC GGCTGCCTGC TGCTTGCCGC 

51 CTGCGGCAAA TCCGAAAATA CGGCGCSAACA GCCGCAAAAC GCGGTACAAA 

101 GCGCGCCGAA ACCGGTTTTC AAAGTCAAAT ATATCGACAA TACGGCGATT 

151 GCCGGTTTGG ATTTGGGACA AAGCAGCGAA GGCAAAACCA ACGACC3GCAA 

201 AAAACAAATC AGTTATCCGA TTAAAGGCTT GCCGGAACAA AATGTTATCC 

251 GACTGATCGG CAAGCATCCC GGCGACTTTCG AAGCCGTCAG CGGCAAATGT 

301 ATGGAAACCG ATGATAAGGA CAGTCCGGCA GGTTGGGCAG AAAAC(K3CGT 

351 GTGCCATACC TTGTTTC3CCA AACTGGTGGG CAATATCGCC GAAGACGGCG 

401 GCAAACTGAC GGATTACCTA GTTTC6CATG CCGCCCTGCA ACCCTATCAG 

451 GCAGGCAAAA GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGCTGGA 

501 AATCGACAGC <3AAGC3G(XX3T TTTATTTCCG CCGCCGCCAT TATTGA 

This corresponds to the amino acid sequence <SEQ ID 712; ORF 205-l>: 

ia205-l,pep 

1 MLKTSFAVLG GCLLLAA CGK SENTAEQPQN AVQSAPKPVF KVKYIDNTAI 
51 AGLDLGQSSE GKTNDGKKQI SYPIKGLPEQ NVIRLIGKHP GDLEAVSGKC 
101 METDDKDSPA GWAENGVCHT LFAKLVGNIA EDGGKLTDYL VSHAALQPYQ 
151 AGKSGYAAVQ NGRYVLEIDS EGAFYFRRRH Y* 

m205-l/g205-l 92.0% identity in 174 aa overlap 

10 20 30 40 50 60 

MLKI PFAVLGGCLLLAACGKS ENTAEQPQNAAQSAPKP VFKVKYIDNTAI AGLALGQSSE 

III I II I III Mill II III III III II hill II I II I II I mil III Mill 1 1 

MIiKTS FAVLGGCLLLAACGKS ENTAEQPQNAVQSAPKP VFKVKYIDNTAI AGLDLGQSSE 
10 20 30 40 50 60 

70 80 90 100 110 120 

GKTNDGKKQI S YPI KGLPEQNAVRLTGKHPNDLEAWGKCMETDGKDAPSGWAENGVCHT 
llllllllllllllllllllh:|| lllhlllll lllllll IhhIIIIIIIIII 
GKTNDGKKQI S YPI KGLPEQNVIRLIGKHPGDLEAVSGKCMETDDKDS PAGWAENGVCHT 



GGCTGCCTGC 
GCCGCAAAAT 
ACATCGACAA 
GGCAAAACCA 
GCCGGAACAA 
AAGCCGTCGT 
GGCTGGGCGG 
CAATATCGCC 
CCGCCCTGCA 
AACGGACGCT 



TGCTTGCCGC 
GCGGCACAAA 
TACGGCGATT 
ACGACGGCAA 
AACGCCGTCC 
CGGCAAATGT 
AAAACGGCGT 
GAAGACGGCG 
ACCCTATCAG 
ATGTGCTGGA 



g205-l .pep 
m205-l 

g205-l.pep 
m20S-l 
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70 80 90 100 110 120 

130 140 150 160 170 

g2 0 5 - 1 . pep LFAKLVGNI AEDGGKLTDYLI SHSALQPYQAGKSGYAAVQNGRYVLE I DSEGAP 

lllllllllllllllllillOhllllllllllllllMIIIIIIIIIIIIII 

Tn2 0 5 - 1 liFAKLVGNI AEDGGKLTDYLVSHAALQPYQAGKSGYAAVQNGRYVLE IDSEGAFYFRRRH 

130 140 150 160 170 180 

m205-l YX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 71 3>: 

a20S-l.seq (partial) 

1 CCTCTTAAAG GCTTGCCGGA ACAAAACGTC GTCCGGCTGA CCGGCAAGCA 

51 TCCCAACGAC TTGGAAGCCX3 TCGTCGGCAA ATGTATGGAA ACCGACGGAA 

101 AGGGCGCGCC TTCGGGCTGG GCGGCAAACG <3CGTGTGCCA TACCTTGTTT 

151 GCCAAACTGG TGGGCAATAT CGCCGAAGAC GGCGGCAAAC TGACGGATTA 

201 CCTGATTTCG CATTCCGCCC TGCAACCCTA TCAGGCAGGC AAAAGCGGCT 

251 ATGCCGCCGT GCAGAACGGA CGCTATGTGC TGGAAATCGA CAGCGAGGGG 

301 GCGTTTTATT TCCGCCGCCG CCATTATTGA 

This corresponds to the amino acid sequence <SEQ ID 714; ORF 205-1. a>: 

a305-l.pep (partial) 

1 PLKGLPEQNV VRLTGKHPND LEAWGKCME TDGKGAPSGW AANGVCHTLF 
51 AKLVGNIAED GGKLTDYLIS HSALQPYQAG KSGYAAVQNG RYVLEIDSEG 
101 AFYFRRRHY* 

ma05-l/a205-l 89.0% identity in 109 aa overlap 

50 60 70 80 90 100 

m205-l,pep KYIDNTAIAGLDIJGQSSEGKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKC^fE 

hlllllllhll lllhlllll Mill 

a2 0 5 - 1 PLKGLPEQNWRLTGKHPNDIiEAWGKCME 

10 20 30 

110 120 130 140 150 160 

tti2 0 5 - 1 . pep TDDKDS PAGWAENGVCHTLFAKLVGNI AEDGGKLTDYLVSHAALQPYQAGKSG YAAVQNG 
II I H:|ll lllllllllllllllllllllllllhlhllllllllllllllllll 
a205 - 1 TDGKGAPSGWAANGVCHTLFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQNG 
40 50 60 70 80 90 

170 180 
m205-l.pep RYVLEIDSEGAFYFRRRHYX 

llllllllllllllllllll 
a205-l RYVLEIDSEGAFYFRRRHYX 
100 110 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 715>: 

g206 .seq 

1 atgttttccc ccgacaaaac ccttttcctc tgtctcggcg cactgctcct 

51 cgcctcatgc ggcacgacct ccggcaaaca ccgccaaccg aaacccaaac 

101 agacagtccg gcaaatccaa gccgtccgca tcagccacat cggccgcaca 

151 caaggctcgc aggaactcat gctccacagc ctcggactca tcggcacgcc 

201 ctacaaatgg ggcggcagca gcaccgcaac cggcttcgac tgcagcggca 

251 tgattcaatt ggtttacaaa aacgccctca acgtcaagct gccgcgcacc 

301 gcccgcgaca tggcggcggc aagccgcaaa atccccgaca gccgcctcaa 

351 ggccggcgac atcgtattct tcaacaccgg cggcgcacac cgctactcac 

401 acgtcggact ctacatcggc aacggcgaat tcatccatgc ccccggcagc 

451 ggcaaaacca tcaaaaccga aaaactctcc acaccgtttt acgccaaaaa 

501 ctaccttgga gcgcatacgt tttttacaga atga 

This corresponds to the amino acid sequence <SEQ ID 716; ORF 206 .ng>: 

g206 .pep 

1 MFSPDKTLFL CLGALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIGRT 
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51 QGSQELMLHS U3LIGTPYKW GGSSTATGFD CSGMIQLVYK N/^LNVKLPRT 
101 ARDMAAASRK IPDSRLKAGD IVFFNTGGAH RYSHVGLYIG NGEFIHAPGS 
151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 71 7>: 

m206 . seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCX3TCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAr AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGsAAA ATCCCCGAcA GCCGCyTCAA 

351 6GCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

401 ACGTCGGACT CTACATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

451 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TTTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ E) 718; ORF 206>: 

m206 .pep. . 

1 MFPPDKTLFL CLSALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIDRT 
51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 
101 ARDMAAASRK IPDSRXKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 
151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF fi-om K gonorrhoeae 

ORF 206 shows 96.0% identity over a 177 aa overlap with a predicted ORF (ORF 206.ng) 
from K gonorrhoeae: 

m206/g206 

10 20 30 40 50 60 

MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 

II llllllllhlllMIIIIIIIIIIIIIIIIIIIIIIIIIIIII llllllllllll 

MFSPDKTLFLCLGALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIGRTQGSQELMLHS 
10 20 30 40 50 60 

70 80 90 100 110 120 

LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRXKAGD 

IIIIIMIIINIIIIIIIIIIIIIhllllllllllllllllllllllllllll Mil 

LGLIGTPYKWGGSSTATGFDCSGMIQLVYKNALNVKLPRTARDMAAASRKIPDSRLKAGD 
70 80 90 100 110 120 

130 140 150 160 170 

LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

I I I I I I I I I I I M I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
IVFFNTGGAHRYSHVGLYIGNGEFIHAPGSGKTIKTEKLSTPFYAKNYLGAHTFFTE 
130 140 150 160 170 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 719>: 

a206.3eq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAA AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGCAAA ATCCCCGACA GCCGCCTTAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

401 ACGTCGGACT CTATATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

451 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TCTTTACAGA ATGA 



in206 .pep 
g206 

m206 .pep 
g206 

m206 .pep 
g206 
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This corresponds to the amino acid sequence <SEQ ID 720; ORF 206.a>: 

a206.pep 

1 MFPPD KTLFL CLSALLLASC GTT SGKHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

m206/a206 99.4% identity in 1 77 aa overlap 

10 20 30 40 50 60 

m2 06 . pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQ^VRISHIDRTQGSQELMLHS 
I i I t I I I I I t I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I t I I I I I I I I I I I 
a206 MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m206.pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRXKAGD 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 i 1 1 nil 

a206 LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRLKAGD 
70 80 90 100 110 120 

130 140 150 160 170 

m206 . pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
I I I I I I I I t I I I I I I I I t M II M II I I I I I I I I I I I II I II II I II II II I II II II 
a206 LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGMTFFTEX 

130 140 150 160 170 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 721>: 

g209.seq 

1 atgctgcggc atttaggaaa cgacttcgcc ttgggcgcgt tgtttttcga 

51 tgctgcggtt gatgtgccac tgctgggcga tggtcaggag gttgttgacc 

101 acccagtaga gaaccaaacc ggcagggaag aagaagaaca tgacggagaa 

151 aaccaacggc atgattttca tcattttcgc ctgcatcggg tcggtcggcg 

201 gcgggttcag ataggtttgg gcgaacatcg ttgccgccat aatgatgggc 

251 aggatgtagt aggggtcggc gcggctgagg tcggtaatcc agcccagcca 

301 aggtgcctgg cgcaattcta cggaggcgaa caatgcccag tacaagccga 

351 tgaagacggg gatttgcaac agcataggca gacagccgcc cagcgggttg 

401 atttcctcgt cttcgaaaag ctgcatcatc gcttgctgtt gcgccatacg 

451 gtcgtcgccg tatttttctt tgatggtctg cagttcgggt gcggcggcac 

501 gcattttcgc catcgaacgg taggaggcgt tggtcaatgg atacagtacg 

551 gctttgacga tgatggtcaa aacgacgatt gcccagcccc agttgccgat 

601 aatgttgtgc agttggttca ggagccagaa gagcggcgat gcgaaccagt 

651 gtactttacc gtagtctttt gccagttgca ggttgtcggc gatgtttgcg 

701 ataacggatg tggtttgcgg accggcatac aggttgaccg ccattttcgg 

751 ttttggcccc cgggttggga tagcggttaa 

This corresponds to the amino acid sequence <SEQ ID 722; ORF 209.ng>: 

g209.pep 

1 MLRHLGND FA LGALFFDAAV DVPLL GDGQE WDHPVENQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ IGLGEHRCRH NDGQDWGVG AAEVGNPAQP 

101 RCLAQFYGGE QCPVQADEDG DLQQHRQT7VA QRVDFLVFEK LHHRLLLRHT 

151 VVAVFFFDGL QFGCGGTHFR HRTVGGVGQW IQYGFDDDGQ NDDCPAPVAD 

201 NWQLVQEPE ERRCEPVYFT WFCQLQWG DVCDNGCGLR TGIQVDRHFR 

251 FWPPGWDSG* 

The following partial DNA sequence was identified in A^. meningitidis <SEQ ID 723>: 

m209.seq 

1 ATGCTGCGGC ATTTAGGAAA CGACTTCGCC TTGGgGCGTT GTTTTTCGAT 

51 GCTGCGGTTG ATGTGCCATT GCTGGGCGAT GGTCAGGAGG TTGTTGACTA 

101 CCCAGTACAA TACCAGACCG GCAGGGAAGA AGAAGAACAT GACGGAGAAA 

151 ACCAACGGCA TGATTTTCAT CATTTTCGCC TGCATCGGGT CGGTCGGCGG 

201 CGGGTTCAGA TAAGTTTGGG CGAACATCGT TGCCGCCATA ATGATGGGCA 

251 GGATGTAGTA GGGGTCGGCG CGGCTGAGGT CGGTAATCCA ACCCAGCCAA 
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301 GGTGCCTGGC GCAATTCTAC GGAGGCGAAC AATGCCCAAT ACAATCCGAT 

351 GAAGACGGGG ATTTGCAACA GCATAGGCAG GCAGCCGCCC AGCGGGTTGA 

401 TTTTCTCGTC TGTGTAAAGC TGCATCATCG CCTGTTGTTG CGCCATACGG 

451 TCGTCGCCGT ATTTCTCTTT GATGGCTTGC AGTTTGGGTG CGGCGGCACG 

501 CATTTTCGCC ATAGAGCGGT AAGAGGCGTT GGTCAATGGA TACAGTACGG 

551 CTTTGACGAT GATGGTTAAA ACGATAATCG CCCAGCCCCA GTTGCCGATG 

601 ATGTTGTGCA GTTGGTTCAG GAGCCAGAAG AGCGGGGAGG CGAACCAGTG 

651 TACTTTGCCG TAGTCTTTGG CCAGTTGCAG GTTGTCGGCG ATGTTTGCGA 

701 TGACGGATGT GGTCTGCGGG CCGGCGTAGA GGTTGATGGA GGCTTCGgTT 

751 TCGCGCCGTT TTGGATGGCG GCTAAAGGCA CGCTGACGCT GGTGCTGTAC 

801 AGCTTGTCGT TGCGGCGTTT GATGTCGATG TTGCACTCGC CTGCGGCGCA 

851 AACGCTTTGT CTGCCTTTAG GTTGGAGAAT CCAGGTGGAC ATGAAGTGGT 

901 GTTCAATCAT GCCGAGCCAG CCGGTCGGGG TTTTGCGGAT GTATTCGGCC 

951 TCGGATTTGC CGGATTTGGC ATCGTCGTCC AAGTCGGAAA AGCTGACTTT 

1001 TTGGAAGTTG CCTTCAGGGG TATAA 

This corresponds to the amino acid sequence <SEQ ID 724; ORF 209>: 

m209.pep 

1 MLRHLGND FA LGALFFDAAV DVPLL GDGQE WDYPVQYQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ ISLGEHRCRH NDGQDWGVG AAEVGNPTQP 

101 RCLAQFYGGE QCPIQSDEDG DLQQHRQAAA QRVDFLVCVK LHHRLLLRHT 

151 WAVFLFDGL QFGCGGTHFR HRAVRGVGQW IQYGFDDDG* NDNRPAPVAD 

201 DWQLVQEPE ERGGEPVYFA WFGQLQWG DVCDDGCGLR AGVEVDGGFG 

251 FAPFWMAAKG TLTLVLYSLS LRRLMSMLHS PAAQTLCLPL GWRIQVDMKW 

301 CSIMPSQPVG VLRMYSASDL PDLASSSKSE KLTFWKLPSG V* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 209 shows 88.5% identity over a 253 aa overlap with a predicted ORF (ORF 209.ng) 
from A'; gonorrhoeae: 

m209/g209 

10 20 30 40 50 60 

m2 0 9 . pep MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDYPVQYQTGREEEEHDGENQRHDFHHFR 

I I I I I I I I I I i I t I I I I I I I I M I I I I I: 1 I : I I I I I I I I I I I I I I I I I I t I I I 

g2 0 9 MLRHLGNDFALGALFFDAAVDVPLLGDGQE WDHPVENQTGREEEEHDGENQRHDFHHFR 

10 20 30 40 50 60 

70 80 90 100 110 120 

ni209 . pep LHRVGRRRVQISLGEHRCRHNDGQDWGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 
IMHIIIIII:|lllllllilllllllllll|ilt|:||i|||i||||||||:|:|||| 
g2 0 9 LHRVGRRRVQIGLGEHRCRHNDGQDWGVGAAEVGNPAQPRCLAQFYGGEQCPVQADEDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 0 9 . pep DLQQHRQAAAQRVDFLVC VKLHHRLLLRHT WAVFLFDGLQFGCGGTHFRHRAVRGVGQW 

M I I I i I I I I I I ] i lllll)llllllllll:|IIIIIIMIIIIIII:| lllli 
g2 0 9 DLQQHRQTAAQRVDFLVFEKLHHRLLLRHT VVAVFFFDGLQFGCGGTHFRHRTVGGVGQW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m209 . pep IQYGFDDDGXNDNRPAPVADDWQLVQEPEERGGEPVYFAWFGQLQWGDVCDDGCGLR 
Ntllllll I): I I I I I I : I t I I I I I I i I I 11111:111 I I I I I I I I I I : I I I I I 
g209 IQYGFDDDGQNDDCPAPVADNWQLVQEPEERRCEPVYFTWFCQLQWGDVCDNGCGLR 

190 200 210 220 230 240 

250 260 270 280 290 299 

in209.pep AGVEVDGGFGF-APFWMAAKGTLTLVLYSLSLRRLMSMLHSPAAQTLCLPLGWRIQVDMK 

: I :: I I II II 
g2 0 9 TGIQVDRHFRFWPPGWDSG 

250 

The foUovraig partial DNA sequence was identified in N. meningitidis <SEQ ID 725>: 

a209.seq 

1 ATGCTGCGGC ATTTAGGAAA CGACTTCGCC TTGGGCGCGT TGTTTTTCGA 
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51 TGCTGCGGTT GATGTGCCAT TGCTGGGCGA TGGTCAGGAG GTTGTTGATC 

101 ACCCAGTACA ATACCAGACC GGCAGGGAAG AAGAAGAACA TGACGGAGAA 

151 AACCAAAGGC ATGATTTTCA TCATTTTCGC CTGCATCGGG TCGGTCGGCG 

201 GCGGGTTCAG ATAGGTTTGG GCGAACATCG TTGCCGCCAT AATGATGGGC 

251 AGGATGTAGT AGGGGTCGGC GCGGCTGAGG TCGGTAATCC AACCCAGCCA 

301 AGGTGCCTGG CGCAATTCTA CGGAGGCGAA CAATGCCCAA TACAATCCGA 

351 TGAAGACGGG GATTTGCAAC AGCATAGGCA GGCAGCCGCC CAGCGGGTTG 

401 ATTTTCTCGT CTGTGTAAAG CTGCATCATG GCTTGTTGCT GCGCCATACG 

451 GTCGTCGCCG TATTTCTCTT TGATGGCTTG CAGTTTGGGC GCGGCGGCAC 

501 GCATTTTCGC CATCGAACGG TAAGAGGCGT TGGTCAATGG ATACAGTACG 

551 GCTTTGACGA TGATGGTTAA AACGATAATC GCCCAGCCCC AGTTGCCGAT 

601 GATGTTGTGC AGTTGGTTCA TVAAGCCAAAA GAGGGGGGAG GCGAACCAGT 

651 GTACTTTGCC GTAGTCTTTG GCCAGTTGCA GGTTGTCGGC GATGTTTGCG 

701 ATAACGGATG TGGTCTGTGG GCCGGCGTAG AGGTTGATGG AGGCTTCGGT 

751 TTCGCACCGT TTTGGATAGC GGCT7WVGGC ACGCTGACGC TGGTGCTGTA 

801 CAGCTTGTCG TTGCGGCGTT TGATGTCGAT ACGGCAGTCG CCAGCGGCGC 

851 AAACGCTTTG TCCGCCTTTG GGTTGGAGGA TCCAGGTGGA CATGAAGTGG 

901 TGTTCAATCA TGCCGAGCCA GCCGGTCGGG GTTTTGCGGA TGTATTCGGC 

951 CTCGGATTTG CCGGATTTGG CATCGTCGTC CAAGTCGGAG AAGCTGACTT 

1001 TTTGGAAGTT GCCTTCAGGG GTATAA 

This corresponds to the amino acid sequence <SEQ ID 726; ORF 209,a>: 

a209.pep 

1 MLRHLGND FA LGALFFDAAV DVPLL GDGQE WDHPVQYQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ IGLGEHRCRH NDGQDWGVG AAEVGNPTQP 

101 RCLAQFYGGE QCPIQSDEDG DLQQHRQAAA QRVDFLVCVK LHHGLLLRHT 

151 WAVFLFDGL QFGRGGTHFR HRTVRGVGQW IQYGFDDDG* NDNRPAPVAD 

201 DWQLVQKPK EGGGEPVYFA WFGQLQWG DVCDNGCGLW AGVEVDGGFG 

251 FAPFWIAAKG TLTLVLYSLS LRRLMSIRQS PAAQTLCPPL GWRIQVDMKW 

301 CSIMPSQPVG VLRMYSASDL PDLASSSKSE KLTFWKLPSG V* 

m209/a209 95.6% identity in 341 aa overlap 

10 20 30 40 50 60 

m2 0 9 . pep MLRHLGNDFALGALFFDAAVDVPLLGDGQE WDYPVQYQTGREEEEHDGENQRHDFHHFR 

I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I i I : I I I M I I M I I I I I I I I M i I M t I I 
a2 0 9 MLRHLGNDFALGALFFDAAVDVPLLGDGQE WDHPVQYQTGREEEEHDGENQRHDFHHFR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m209.pep LHRVGRRRVQISLGEHRCRHNDGQDVVGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 
i I I I I I I I I I t : I I I I i I I I I I I I I I I I M I I I I I I I I I i I I I I I 1 I I I I I I I I M I I I I 
a209 LHRVGRRRVQIGLGEHRCRHNDGQDWGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

102 0 9 . pep DLQQHRQAAAQRVDFLVCVKLHHRLLLRHTVVAVFLFDGLQFGCGGTHFRHRAVRGVGQW 
i i I I I I I I I I I t I I I I I t I i I I I I I I I I I I I I I M I 1 I I I I I I I I I I I I I : I I I I I i I 
a2 0 9 DLQQHRQAAAQRVDFLVCVKLHHGLLLRHTVVAVFLFDGLQFGRGGTHFRHRTVRGVGQW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m209 . pep IQYGFDDDGXNDNRPAPVADDWQLVQEPEERGGEPVYFAWFGQLQWGDVCDDGCGLR 

I I I f I I I I I I ) I I I I I I I I I I I I I I I I : I : I I I I I I I I I I I I I I I I I I I I I I I : I I I I 
a209 IQYGFDDDGXNDNRPAPVADDWQLVQKPKEGGGEPVYFAWFGQLQWGDVCDNGCGLW 

190 200 210 220 230 240 

250 260 270 280 290 300 

ra20 9 . pep AGVEVDGGFGFAPFWMAAKGTLTLVLYSLSLRRLMSMLHSPAAQTLCLPLGWRIQVDMKW 
I t I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I M : : I I I I I I I I I I I I I I I I I | | | 
a209 AGVEVDGGFGFAPFWIAAKGTLTLVLYSLSLRRLMSIRQSPAAQTLCPPLGWRIQVDMKW 

250 260 270 280 290 300 



Tn209.pep 



310 320 330 340 

CSIMPSQPVGVLRMYSASDLPDLASSSKSEKLTFWKLPSGVX 
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a209 



I n I I I I I I I I I I I I I I I I I I I I I t I I I i I I I I I I I I I I I I I 
CSIMPSQPVGVLRMYSASDLPDLASSSKSEKLTFWKLPSGVX 
310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 727>: 

g211 . seq 



1 atgttgcgga ttgctgctgc caatcagttg ggcggtcgaa atggtgcggc 

51 ggtgggaaac ggggtcgata agtttgggcg tggtgctgat aatcaggttg 

101 agtttttgga aggaaacctg attgtagtcg gcgcgtccgg gcgtgccgct 

151 gtaacggtag ccgtggcgca attcgagcgt gcgtttgttg tccttcagcg 

201 agaagttacc ttctttggcg aagatgatgt tgtcgccgcc gtttttgtcc 

251 tgttcgcgca ggaacaggtt • tttcatgatg ccggattcgg tgtcaaaggt 

301 ttcgacgaaa taaaccctgc cgttgcgctt gcccaagtta ttgaactcgc 

351 cggcttccac caaagacaat tcctgcttct gcttcaaaat ttcggcatat 

401 tcgcggctgc gcagctctgc ccacggtatc acccaaagct gcatgacggc 

451 aatcaggatg gcaaacggca cggcaaactg catgacgggg cgtatccact 

501 gtttcaacgc caatccgcag gatag 



This corresponds to the amino acid sequence <SEQ ID 728; ORF 21 l.ng>: 

g211.pep 



1 MLRIAAANQL GGRNGAAVGN GVDKFGRGAD NQVEFLEG NL IWGASGRAA 
51 VTVAVAQFER AFWLQRE VT FFGEDDWAA VFVLFA QEQV FHDAGFGVKG 



101 FDEINPAVAL AQ VIELAGFH QRQFLLLLQN FGIFAAAQLC PRYHPKLHDG 
151 NQDGKRHGKL HDGAYPLFQR QSAG* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 729>: 



1 ATGTTGCGGG TTGCTGCTGC CAATCAGTTG GGCGGTCGGA ATGGTACGGC 

51 GGTGGGAAAC GGGGTCGATG AGTTTGGGCG TGGTGCTGAT AATCAGGTTG 

101 AGTTTTTGGA AGGAAACCTG ATTGTAGTCG GCGCGTCCGG GCGTGCCGCT 

151 GTAACGGTAG CCGTGGCGCA ATTCGAGCGT GCGTTTGTTG TCGTTCAGCG 

201 AGAAGTTACC TTCTTTGGCG AAGATGATGT TGTCGCCGCC GTTTTTGTCC 

251 TGTTCGCGCA GGAACAGGTT TTTCATGATG CCGGATTCGG TATCGAAGGT 

3 01 TTCGACAAAA TAAACCCTGC CGTTGCGCTT GCCCAAACTG TTGAACTCGC 

351 CTGCCTCCAC CAAAGACAAT TCCTGCTTCT GCTTCAGGAT TTCAGCGTAT 

401 TCGCGGCTGC GTAGCTCTGC CCACGGTATC ACCCAAAGCT GCATGACGGC 

451 AACCAAAACG GCAAACGGCA CGGCAAACTG CATCACCGGG CGTATCCATT 

501 GTTTCAATGC CAATCCGCAg GATAG 



This corresponds to the amino acid sequence <SEQ ID 730; ORF 21 1>: 

m211.pep 

1 MLRVAAANQL GGRNGTAVGN GVDEFGRGAD NQVEFLEGN L IWGASGRAA 
51 VTVAVA QFER AFVWQRE VT FFGEDDWAA VFVLFA QEQV FHDAGFGIEG 



101 FDKINPAVAL AQ TVELACLH QRQFLLLLQD FSVFAAAXLC PRYHPKLHDG 
151 NQNGKRHGKL HHRAYPLFQC QSAG* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF firom K gonorrhoeae 

ORF 2 1 1 shows 89.1% identity over a 1 74 aa overlap with a predicted ORF (ORF 2 1 1 .ng) 
firom K gonorrhoeae: 



m2ll.seq 



m211/g211 



m211.pep 



10 20 30 40 50 60 

MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNLIWGASGRAAVTVAVAQFER 



g2ii 




10 20 30 40 50 60 



m211.pep 



70 80 90 100 110 120 

AFVWQREVTFFGEDDWAAVFVLFAQEQVFHDAGFGIEGFDKINPAVALAQTVELACLH 
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lllhlllllllllllllllllllllllllllllllh:|||:|||||||l|::||l 
g211 AFWLQREVTFFGEDDWAAVFVLFAQEQVFHDAGFGVKGFDEINPAVALAQVIELAGFH 
70 80 90 100 110 120 

130 140 150 160 170 

m2 1 1 . pep QRQFLLLLQDFSVFAAAXLCPRYHPKLHIXSNQNGKRHGKIiHHRAYPIjFQCQSAGX 

llllllllhhHIII IIIIIIIIMIIIhllllllll Mini III! 

g211 QRQFLLLLQNFGIFAAAQLCPRYHPKLHDGNQDGKRHGKIiHDGAYPLFQROSAG 
130 140 150 160 170 . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 731>: 

a211 . seq 

1 ATGTTGCGGG TTGCTGCTGC CAATCAGTTG GGCGGTCGGA ATGGTACGGC 

51 GGTGGGAAAC GGGGTCGATG AGTTTGGGCG TGGTGCTGAT AATCAGGTTG 

101 AGTTTTTGGA AGGAAACCTG ' ATTGTAGTCG GCGCGTCCGG GCGTGCCGCT 

151 GTAACGGTAG CCGTGGCGCA ATTCGAGCGT GCGTTTGTTG TCGTTCAGCG 

201 AGAAGTTACT TTCTTTGGCG AAGATGATGT TGTCGCCGCC GTTTTTGTCC 

251 TGTTCGCGCA GGAACAGGTT TTTCATGATG CCGGATTCGG TATCGAAGGT 

301 TTCGACA7VAA TAAACCCTGC CGTTGCGCTT GCCCAAACTG TTGAACCCGC 

351 CTGCCTCCAC CAAAGACAAT TCCTGCTTCT GCTTCAGGAT TTCAGCGTAT 

401 TCGCGGCTGC GTAGCTCTGC CCACGGTATC ACCCAAAGCT GCATGACGGC 

451 AACCAAAACG GCAAACGGCA CGGCAAACTG CATCACCGGG CGTATCCATT 

501 GTTTCAATGC CAATCCGCAG GAT AG 

This corresponds to the amino acid sequence <SEQ ID 732; ORF 21 La>: 

a211.pep 

1 MLRVAAANQL GGRNGTAVGN GVDEFGRGAD NQVEFLEGN L IWGASGRAA 

51 VTVAVA QFER AFVWQRE VT FFGEDDWAA VFVLF AQEQV FHDAGFGIEG 

101 FDKINPAVAL AQTVEPACLH QRQ FLLLLQD FSVFAAA*LC PRYHPKLHDG 

151 NQNGKRHGKL HHRAYPLFQC QSAG* 

m21 l/a21 1 99,4% identity in 174 aa overlap 

10 20 30 40 50 60 

in211 . pep MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNLIWGASGRAAVTVAVAQFER 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

a211 MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNLIWGASGRAAVTVAVAQFER 
10 20 30 40 50 60 

70 80 90 100 110 120 

m211 , pep AFVWQREVTFFGEDDWAAVFVLFAQEQVFHDAGFGIEGFDKINPAVALAQTVELACLH 
I i I I t I I I I I I I I I I I i I 1 1 I I I I I I I I I i I I I I I I I I I I I I I I 1 I I M I I I I I I I I I I 
a211 AFWVQREVTFFGEDDWAAVFVLFAQEQVFHDAGFGIEGFDKINPAVALAQTVEPACLH 
70 80 90 100 110 120 

130 140 150 160 170 

m211 . pep QRQFLLLLQDFSVFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 
I I I M i I I I I M I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I 
a211 QRQFLLLLQDFSVFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 

130 140 150 160 170 



The following partial DNA sequence was identified in A^. gonorrhoeae <SEQ ID 733>: 



g212 . seq 


(partial) 










1 


atggacaatc 


tcgtatggga 


cggcattccc 


gacatccgca 


cactcgacca 


51 


aaccatccgc 


aaacacgcac 


acccgctcaa 


cctgattgtc 


tgcctccccg 


101 


ataatcagat 


tcccgatttt 


caaaccgcac 


aagatgcttc 


ggactcggaa 


151 


tgccgtctga 


agcaccgttt 


ggatcaggca 


acccagtgcc 


tccagttcga 


201 


cagcatcaac 


ctcatcgaac 


acatcctgcc 


cgatgtccgc 


ttctggctgg 


251 


ttcccccttc 


acgcacccgc 


cgcctgcacg 


aacacttcca 


ccacatttcc 


301 


tggcagaccg 


aagccatccc 


gcaaaccgaa 


agcaagtccg 


acaaaccctg 


351 


gtttgcactt 


ccacaaacat 


ccgaacggaa 


aaaaccggaa 


cacgtcctcg 
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401 tcatcggtgc aggcattgcc ggcgcatcga ccgcccacgc cttagcatca 

451 cacggcattt ccgttaccgt attggaagcc cgaaaagccg ctcaagccgc 

501 cagcggcaac cggcaagggc tgctttacgc caaaatctcg ccgcacgaca 

551 ccggacagac cgaactgctg cttgccggct acggctacac caaacgcctg 

601 ctcggacaca tcctgcccga ctccgacact tggggcggca acggcatcat 

651 ccacctcaat tacagccgca ccgaacaaca acgcaatcac gaattgggtt 

701 tgcaaaaaca ccataaccac ctctaccgca gcatcacgtc tgcagaagcc 

751 gaaaaaatcg ccggcatccc gctgaacacg ccctacgccg aaccattatg 

801 cggactctac tggcaacacg gcgtatggct caatccgccc gcattcgtcc 

851 gcaccctcct cagccatccg ctgatcgaac tatatgaaaa cacaacgtta 

901 accggcattt cccacgacgg agaaaagtgg attgcaagca cgccaaacgg 

951 cacatttacc gccacacaca tcatctactg caccggcgcg cacagcccct 

1001 gcctgcccga aaccaacctc gccgccctac ccctcaggca aatacgcgga 

1051 caaaccggcc tcacaccgtc caccccgttt tccgaacaac tgcgttgcgc 

1101 cgtttcaggc gaaagctaca tcagcccgtc gtggcacgga ctgcactgct 

1151 acggcgcgag ttttattccc aacagcagca ataccggatg gaacgaagcc 

1201 gaagaagcct caaaccgcca agcattggca caccttaacc ccgcccttgc 

1251 cgaatcattg ttt... 

This corresponds to the amino acid sequence <SEQ ID 734; ORF 212.ng>: 

g212.pep (partial) 

1 MDNLVWDGIP DIRTLDQTIR KHAHPLNLIV CLPDNQIPDF QTAQDASDSE 

51 CRLKHRLDQA TQCLQFDSIN LIEHILPDVR FWLVPPSRTR RLHEHFHHIS 

101 WQTEAIPQTE SKSDKPWFAL PQTSERKKPE HVLVIGAGIA GASTAHAIA S 

151 HGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTGQTELL LiAGYGYTKRL 

201 LGHILPDSDT WGGNGIIHLN YSRTEQQRl^ ELGLQKHHNH LYRSITSAEA 

251 EKIAGIPLNT PYAEPLCGLY WQHGVWLNPP AFVRTLLSHP LIELYENTTL 

301 TGISHDGEKW lASTPNGTFT ATHIIYCTGA HSPCLPETNL AALPLRQIRG 

351 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSNTGWNEA 

401 EEASNRQALA HLNPALAESL F. . . 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 735>: 

m212 .seq 

1 ATGGACAATC TCGTATGG6A CGGCATTCCC GACATCCGCA CACTCGACCA 

51 AGCCATCCGC AAACACGCAC CCCCGCTCAA CCTGATTATC TGCCTCCCCG 

101 ATAATCAGAT TCCCGATTTT CAAACCGCAC AAGATGCTTC GGACGCGGAA 

151 TGCCGTCTGA AGCACCGTTT GGATCAGGCA ATGCAGTGCC TCCAGTTCGA 

201 CAGCATCAAC CTGATCGAAC ACATCCTGCC CGATGTCCGC TTCTGGCTGG 

251 TTCCCCCTTC ACGCACCCAC CACCTGCACG AACATTTCCA CCACATTTCC 

301 TGGCAGACCG AAGCCATCCC GCAAACCGAA AGCAAGCCCG ACAAACCCTG 

351 GTTTGCACTT CCACAAACAT CCGAACGGCA AAAACCGGAA CACATCCTCG 

401 TTATCGGCGC GGGCATATCC GGCGCGGCAA CCGCCCACGC CTTAGCATCA 

451 CACGGCATTT CCGTTACCGT ATTGGAAGCC CGAAAAGCCG CCCAAGCCGC 

501 CAGCGGCAAC CGGCAAGGGC TGCTCTACGC CAAAATCTCG CCGCACGACA 

551 CCGAACAGAC CGAACTTTTG CTTGCCGGCT ACGGCTACAC CAAACGCCTG 

601 CTCGGACACA TCCTGCCCGA ATCCGAAACC TGGGGCGGCA ACGGCATCAT 

651 CCACCTCAAT TACAGCCGCA CCGAACAACA ACGCAATCAC GAATTGGGTT 

701 TGCAAAAACA CCATAACCAC CTCTACCGCA GCATCACATC TGCAGAAGCC 

751 GAAAAAATCG CCGGTATCCC ACTGTCCGTC CCATACGACC ACCCTTCATG 

801 CGGACTCTAC TGGCAACACG GCGTATGGCT CAATCCACCC GCATTCGTCC 

851 GCACCCTCCT CAACCATCCG CTCATTGGAC TACACGAAGA CACACCCTTG 

901 ACCGACATTT CCCACGACGG GGaAAAGTGG ATTGCAAGCA CGCCAAACGG 

951 CACATTTACC GCCACACACA TCATCTACTG CACCGGTGCG AACAGCCCCT 

1001 ACCTACCCGA AACCAACCTC GCCGCCCTGC CTCTCAGGCA AATACGCGGA 

1051 CAAACCGGCC TCACACCGTC CACCCCGTTT TCCGAACAAC TGCGTTGCGC 

1101 CGTTTCAGGC GAAAGCTACA TCAGCCCGTC GTGGCACGGA CTGCACTGCT 

1151 ACGGCGCGAG TTTTATTCCC AACAGCAGCC ATACCGGATG GAACGAAGCC 

1201 GAAGAAGCCT CAAACCGCCA AGCATTGGCA CACCTTAACC CCGCCCTTTC 

1251 CGAATCATTG TTTGCCGCCA ACCCAAACCC CCAAAAACAC CAAGGGCACG 

1301 CCGCCATACG CTGCGACAGC CCCGACCACC TTCCCCTAGT CGGCGCACTC 

1351 GGCGACATTG CCGCCATGCG GCAGACCTAC ACCAAACTCG CGCTGGACAA 

1401 AAACTACCGC ATCGACACCC CATGCCCATA CCTGCCTAAT GCCTACGTCA 

1451 ACACCGCGCA CGGCACCCGC GGACTCGCCA CCGCCCCCAT CTGCGCCGCC 
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1501 GmCAwTGCAG CCCAAATCsT AGGCyTGCCC CATCCCTTTT yAcAAcGCCT 
1551 gCGCCACXSCC cTAcACCCCA ACCGCACCAT CATCCGCGCC ATCGTCAGAA 
1601 GGAAGQATCT AACCCCTTAA 

This corresponds to the amino acid sequence <SEQ ID 736; ORF 212>: 

m212 .pep 

1 MDNLVWDGIP DIRTLDQAIR KHAPPLNLII CLPDNQIPDF QTAQDASDAE 

51 CRLKHRLDQA MQCLQFDSIN LIEHILPDVR FWLVPPSRTH HLHEHFHHIS 

101 WQTEAIPQTE SKPDKPWFAL PQTSERQKPE HILVIGAGIS GAATAHAL AS 

151 HGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTEQTELL LAGYGYTKRL 

201 LGHILPESET WGGNGIIHLN YSRTEQQRNH ELGLQKHHNH LYRSITSAEA 

251 EKIAGIPIiSV PYDHPSCGLY WQHGVWLNPP AFVRTLLNHP LIGLHEDTPL 

301 TDISHDGEKW lASTPNGTFT ATHIIYCTGA NSPYLPETNL AALPLRQIRG 

351 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSHTGWNEA 

401 EEASNRQALA HLNPALSESL FAANPNPQKH QGHAAIRCDS PDHLPLVGAL 

451 GDIAAMRQTY TKLALDKNYR IDTPCPYLPN AYVNTAHGTR GLATAPICAA 

501 XXAAQIXGLP HPFXQRLRHA LHPNRTIIRA IVRRKDLTP* 

Computer analysis of this amino acid sequence gave the following resuUs: 
Homology with a predicted ORF from AT. gonorrhoeae 

ORF 212 shows 92.9% identity over a 421 aa overlap with a predicted ORF (ORF 212.ng) 
from N, gonorrhoeae: 

m212/g212 

10 20 30 40 50 60 

MDNIjVWDGIPDIRTLDQAIRKHAPPLNLIICLPDNQIPDFQTAQDASDAECRLKHRLDQA 

IIIIIIIIIIIIMIIhlllll llllhlllllllllllllllllhlllllllllll 

MDNLVWDGIPDIRTLDQTIRKHAHPLNLIVCLPDNQIPDFQTAQDASDSECRLKHRLDQA 
10 20 30 40 50 60 

70 80 90 100 110 120 

MQCLQFDSINLIEHILPDVRFWLVPPSRTHHLHEHFHHISWQTEAIPOTESKPDKPWFAL 

lllllllllllllllllllllllllll|::|||||||||||||||lillll lllllll 
TQCLQFDSINLIEHILPDVRFWLVPPSRTRRLHEHFHHISWQTEAIPQTESKSDKPWFAL 
70 80 90 100 110 120 

130 140 150 160 170 180 

PQTSERQKPEHILVIGAGISGAATAHALASHGISVTVLEARKAAQAASGNRQGLLYAKIS 

INIII^IIIhllllllhlhlllllllMMIIIIIIIIIMIIIMIMIIIIIII 

PQTSERKKPEHVLVIGAGIAGASTAHALASHGISVTVLEARKAAOAASGNRQGLLYAKIS 
130 140 150 160 170 180 

190 200 210 220 230 240 

PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 

INI llllllllilllllllllllhhillllllllllllllllllllllllllllll 

PHDTG0TELLLAGYGYTKRLLC2IILPDSDTWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 
190 200 210 220 230 240 

250 260 270 280 290 300 

LYRSITSAEAEKIAGIPLSVPYDHPSCGLYWQHGVWLNPPAFVRTLLNHPLIGLHEDTPL 
MIIIIIIIIMIIIII|::|| :| I M I I I I I I I I I I I I I I I I | | : | | | | |:|:| | 
LYRSITSAEAEKIAGIPLNTPYAEPLCGLYWQHGVWLNPPAFVRTLLSHPLIELYENTTL 
250 260 270 280 290 300 

310 320 330 340 350 360 

TDISHDGEKWIASTPNGTFTATHIIYCTGANSPYLPETNLAALPLRQIRGQTGLTPSTPF 

I IIIIMIIIIIIIIIIIIIIIIIIIIIhll IIIIIIIIIIIIIIIIMIIIIIIII 

TGISHDGEKWIASTPNGTFTATHIIYCTGAHSPCLPETNLAALPLRQIRGQTGLTPSTPF 
310 320 330 340 350 360 

370 380 390 400 410 420 

SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSHTGWNEAEEASNRQALAHLNPALSESL 



m212 .pep 
g212 

m2l2 .pep 
g212 

m212 .pep 
g212 

m212 .pep 
g212 

m212 .pep 
g212 

m212 .pep 
g212 

m212 .pep 
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llllllllllllllllllllllllllllllllhlllllllllllllllllllllhlll 

g2 1 2 SEQLRCAVSGES YI S PSWHGLHCYGAS FI PNSSNTGWNEAEEASNRQALAHLNPALAESL 

370 380 390 400 410 420 

430 440 450 460 470 480 

m212.pep FAANPNPQKHQGHAAIRCDSPDHLPLVGALGDIAAMRQTyTKLALDKNYRIDTPCPYLPN 
I 

g212 F 



AAA 








I 








51 




AAAPAPftPAP 


APPPCPTPA A 


101 




"PPP P A A T T 


PA A APPPPAP 


151 




A(^PAPPf2TTT 




201 




PTf5ATTf3AAP 


APATPPTPPP 


251 








301 




A Af^PPATPPP 


PPA A BPPP A a 


351 




PPaPA A APAT 




401 




^sV^o^ril/i J. V^L> 


(j^jt- o^^LjVjL. Aft 


451 




PPfSTTAPPf^T 


ATTPP A aPPP 


501 


rAr;pf^pAAP 






551 


CCGAACAAAC 


CGAACTGCTG 


CTTGCCGGCT 


601 


CTCGGACATA 


TCCTGCCCGA 


ATCCGAAACC 


651 


CCACCTCAAT 


TACAGCCGCA 


CCGAACAACA 


701 


TGCAAAAACA 


CCATAACCAC 


CTCTACCGCA 


751 


GAAAAAATCG 


CCGGCATCCC 


TCTGAACACG 


801 


CGGACTGTTT 


TGGCAGTACG 


GCGTATGGCT 


851 


GCGCCCTCCT 


CAGCCATCCG 


CTCATTGGAC 


901 


ACCGACATTT 


CCCACGACGG 


GGAAAAGTGG 


951 


CACATTTACC 


GCCACACACA 


TCATCTACTG 


1001 


ACCTACCCGA 


AACCAACCTC 


GCCACCCTGC 


1051 


CAAACCGGCC 


TCACACCGTC 


CACCCCGTTT 


1101 


CGTTTCAGGC 


GAAAGCTACA 


TCAGCCCGTC 


1151 


ACGGCGCGAG 


TTTTATTCCC 


AACAGCAGCC 


1201 


GAAGAAGCCT 


CAAACCGCCA 


AGCATTGGCA 


1251 


CGAATCATTG 


TTTGCCGCCA 


ACCCAAACCC 


1301 


CCGCCATACG 


CTGCGACAGC 


CCCGACCACC 


1351 


GGCGACATTG 


CCGCTATGCA 


ACAAACTTAC 


1401 


AAACTATCGC 


ATCGATGCCC 


CCTGCCCGTA 


1451 


ACACCGCCCA 


CGGCACACGC 


GGGCTTGCCA 


1501 


GCCGTTGCAG 


CCGAAATCCT 


AGGCTTGCCC 


1551 


GCGCCACGCC 


CTACACCCCA 


ACCGCGCCAT 


1601 


GGAAGGATCT 


AACCCCTTAA 





CCCTCAGGCA AATACGCGGA 



CATCCCCTCT CAAAACGCCT 
CATCCGCGCC ATCGTCAGAA 

This corresponds to the amino acid sequence <SEQ ID 738; ORF 212.a>: 

a212.pep 

1 MDNLAWNGIP DIRTLDQTIR KHAHPLNLIV CLPDNQIPNF QTAQDASDAE 

51 CRLKHRLDQA TQCLQFDSIN LIEHILPDVR FWLVPPSRTR RLHEHFHHIS 

101 WQTEAIPQTE SKPDKPWFAL PQTSERQKPE H ILVIGAGIS GAATAHAL AS 

151 YGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTEQTELL LAGYGYTKRL 

201 LGHILPESET WGGNGIIHLN YSRTEQQRNH ELGLQKHHNH LYRSITQAEA 

251 EKIAGIFLNT PYAEPLCGLF WQYGVWLNPP TFVRALLSHP LIGLHEDTPL 

301 TDISHDGEKW lASTPNGTFT ATHIIYCTGA NSPYLPETNL ATLPLRQIRG 

351 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSHTGWNEA 

4 01 EEASNRQALA HLNPALSESL FAANPNPQKH QGHAAIRCDS PDHLPLVGAL 

451 GDIAAMQQTY AKLALDKNYR IDAPCPYLPN AYANTAHGTR G LATAPICAA 

501 AVAAEILGLP HPLSKRLRHA LHPNRAIIRA IVRRKDLTP* 

m212/a212 93.7% identity in 539 aa overlap 

10 20 30 40 50 60 

ra212.pep MDNLVWDGIPDIRTLDQAIRKHAPPLNLIICLPDNQIPDFQTAQDASDAECRLKHRLDQA 

lill:IHII>IIMII: 111:111 : Illllilllll 

a212 MDNLAWNGIPDIRTLDQTIRKHAHPLNLIVCLPDNQIPNFQTAQDASDAECRLKHRLDQA 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m212 .pep MQCLQFDSINLIEHILPDVRFWLVPPSRTHHLHEHFHHISWQTEAIPQTESKPDKPWFAL 
lllllll)llllllllllllllllll)l::||ltMIIIIIIIIIIIIIIIIIIIIIII 
a212 TQCLQFDSINLIEHILPDVRFWLVPPSRTRRLHEHFHHISWQTEAIPQTESKPDKPWFAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m212 .pep PQTSERQKPEHILVIGAGISGAATAHALASHGISVTVLEARKAAQAASGNRQGLLYAKIS 

IIIIIIIIIMIMIIIMIIIIIIIIIICtI IIIIIIIMIIIIIII 

a212 PQTSERQKPEHILVIGAGISGAATAHAIiASYGISVTVLEARKAAQAASGNRQGLLYAKIS 

130 140 150 160 170 180 

190 200 210 220 230 240 

m2l2 .pep PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 

111 I till 1 1 nil III 1 11)11 III II mil Ml II I Mil nil Ml II 1 11 1 II II 

a 2 1 2 PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGI IHLNYSRTEQQRNHELGLQKHHNH 

190 200 210 220 230 240 

250 260 270 280 290 300 

m212 . pep LYRSITSAEAEKIAGIPLSVPYDHPSCGLYWQHGVWLHPPAFVRTLLNHPLIGLHEDTPL 
llllll:IIIIIMMM::ll :l I II : II : II I n I I : II I : I I : II M I II I II II 
a212 LYRSITQAEAEKIAGIPLNTPYAEPLCGLFWQYGVWLNPPTFVRALLSHPLIGLHEDTPL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m212.pep TDISHDGEKWIASTPNGTFTATHriYCTGANSPYLPETNLAALPLRQIRGQTGLTPSTPF 
IIIIIIMIIIIIIMIIIIIMIIMMIIMIIIIMIIMIMIIIIIII IIMMI 
a212 TDISHDGEKWIASTPNGTFTATHIIYCTGANSPYLPETNLATLPLRQIRGQTGLTPSTPF 

310 320 330 340 350 360 

370 380 390 400 410 420 

m212.pep SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSHTGWNEAEEASNRQALAHLNPALSESL 
n M M II I I I I I I I I I I II I I M M I II M M I I I II I II M I II I I I II I II I I I I II 
a2 1 2 SEQLRCAVSGES YI S PSWHGLHC YGAS FI PNSSHTGWNEAEEASNRQALAHLNPALSESL 

370 380 390 400 410 420 

430 440 450 460 470 480 

m212.pep FAANPNPQKHQGHAAIRCDSPDHLPLVGALGDIAAMRQTYTKLALDKNYRIDTPCPYLPN 
I M I I I I M I I I 1 .1 I I I It I II I II I I II I I II I M : I M : I II I n I I I M : II I I I II 
a212 FAANPNPQKHQGHAAIRCDSPDHLPLVGALGDIAAMQQTYAKLALDKNYRIDAPCPYLPN 

430 440 450 460 470 480 

490 500 510 520 530 540 

m212 .pep AYVNTAHGTRGLATAPICAAXXAAQIXGLPHPFXQRLRHALHPNRTIIRAIVRRKDLTPX 
IMIMMIMnilMIM ||:| |||M: : II I I I I I M I : II I II I II I I I II t 
a212 AYANTAHGTRGLATAPICAAAVAAEILGLPHPLSKRLRHALHPNRAIIRAIVRRKDLTPX 

490 500 510 520 530 540 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 739>: 

g214 . seq 

1 atgatacaaa agatatgtaa gctatttgtt ttaattgtaa tttttgcaac 

51 ttctcccgct tttgcccttc aaagcgacag cagacggccc atccaaatcg 

101 aagccgacca aggttcgctc gatcaagcca accaaaggac cacatttagc 

151 ggcaatgtca tcatcagaca gggtacgctc aacatttccg cctcgtgtgt 

201 caacgtcaca cgcggcaggc aaaggcggcg aatccgtgag ggcggaaggt 

251 tcgcccgtcc gcttcagcca aacgttggac gggggcaaag ggacggtgcg 

301 cggtcaggca aacaacgtta cctattcctc cgcaggaagc actgtcgttc 

351 tgaccggcaa tgccaaagtg cagcgcggcg gcgacgttgc cgaaggtgcg 

401 gtcattacct acaacaccaa aaccgaagtc tataccatca acggcagcac 

451 gaaatcgggt gcgaaatccg cttccaaaac cggcagggtc agcgtcgtca 

501 tccagccttc aagcacacaa aaaaccgaat aaccccgatg ccgtctgaaa 

551 cggaaacgca gttcagacgg catttgccga ccgaaatgcc gagaagagat 

601 tattga 

This corresponds to the amino acid sequence <SEQ ID 740; ORF 214.ng>: 

g214.pep 
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1 MIQKICKLFV LIVIFATSPA FA LQSDSRRP IQIEADQGSL DQANQRTTFS 
51 GNVIIRQGTL NISASCVNVT RGRQRRRIRE GGRFARPLQP NVGRGQRDGA 
101 RSGKQRYLFL RRKHCRSDRQ CQSAARRRRC RRCGHYLQHQ NRSLYHQRQH 
151 EIGCEIRFQN RQGQRRHPAF KHTKNRITPM PSETETQFRR HLPTEMPRRD 
201 Y 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 741>: 

m214.seq (partial) 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 

51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 

101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 

151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 

201 CAATGTTACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCAGTCCG CTTCAGCCAG ACATTGGACG GCGGCAAAGG CACGGTGCGC 

301 GGACAGGCAA ACAACGTTGC TTATTCATCT GCAGGCAGCA CCGTAGTCTT 

3 51 AACCGGTAAT GCCAAAGTAC AGCGCGGCGG CGATGTCGCC GAAGGTGCGG 

401 TGATTACATA CAACACCAAA ACCGAAGTCT ATACCATCAG CGGCAGCACA 

451 AAATT... 

This corresponds to the amino acid sequence <SEQ ID 742; ORF 214>: 

ni214.pep (partial) 

1 MIQKICKLFV LIAFFSASPA FA LQ5DSRQP IQIEADQGSL DQANQSTTFS 
51 GNWIRQGTL NISAARVNVT RGRQRRRIRE GGRFASPLQP DIGRRQRHGA 
101 RTGKQRCLFI CRQHRSLNR* CQSTARRRCR RRCGDYIQHQ NRSLYHQRQH 
151 KI... 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 214 shows 80.3% identity over a 152 aa overlap with a predicted ORF (ORF 214.ng) 
fix>m N, gonorrhoeae: 

m214/g214 

10 20 30 40 50 60 

m214.pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTL 

| | | | | | | : | | | | | | 

g214 MIQKICKLFVLIVIFATSPAFALQSDSRRPIQIEADQGSLDQANQRTTFSGNVIIRQGTL 
10 20 30 40 50 60 

70 80 90 100 110 120 

m214.pep NISAARVNVTRGRQRRRIREGGRFASPLQPDIGRRQRHGARTGKQRCLFICRQHRSLNRX 

1111= IIIMIMIIIIIIIIIII ||||::|| II llhllll II: hi :| 
92 1 4 NI S AS CVNVTRGRQRRRI REGGRFARPLQPNVGRGQRDGARSGKQRYLFLRRKHCRSDRQ 

70 80 90 100 110 120 

130 140 150 

m2 14 . pep CQSTARRRCRRRCGDYIQHQNRSLYHQRQHKI 

llhllll MM hllllllllllllhl 

9214 CQSAARRRRCRRCGHYLQHQNRSLYHQRQHEIGCEIRFQNRQGQRRHPAFKHTKNRITPM 
130 140 150 160 170 180 

g2 14 PSETETQFRRHLPTEMPRRDY 
190 200 

The following partial DNA sequence was identified in A'', meningitidis <SEQ ID 743>: 

a214 . seq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 

51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 

101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 

151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 

201 CAATGTTACA CGCGGC.GGC AAAGGCGGCG AATCCGTGAG GGCGGAAGGT 

251 TCGCCAGTCC GCTTCAGCCA GACATTGGAC GGCGGCAAAG GCACGGTGCG 

301 CGGACAGGCA AACAACGTTG CTTATTCATC TGCAGGCAGC ACCGTAGTCT 
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351 TAACCGGTAA 

4 01 GTGATTACAT 

451 AJ^AATCCGGC 

501 TCCAGCCTTC 

551. CATAAACCTG 

601 TTATGA 



TGCCAAAGTA 
ACAACACCAA 
GCAAAATCCG 
GAGTACGCAA 
GTTCGGACGG 
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CAGCGCGGCG 
AACCGAAGTC 
CTTCCAAATC 
AAATCCGAAT 
CATTTGCCGA 



GCGATGTCGC 
TATACCATCA 
CGGCAGGGTC 
AATCCCAATG 
CCGAAATATT 



CGAAGGTGCG 
GCGGCAGCAC 
AGCGTCGTTA 
CCGTCTGAAA 
GAAGAGATAT 



This corresponds to the amino acid sequence <SEQ ID 744; ORF 214.a>: 

a214 .pep 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 

51 GNWIRQGTL NISAARVNVT RGXQRRRIRE GGRFASPLQP DIGRRQRHGA 

101 RTGKQRCLFI CRQHRSLNR* CQSTARRRCR RRCGDYIQHQ NRSLYHQRQH 

151 KIRRKIRFQI RQGQRRYPAF EYAKIRIIPM PSET*TWFGR HLPTEILKRY 

201 L* 



ni2 14/a2 1 4 99.3% identity in 1 52 aa overlap 

10 20 



30 



40 



50 



60 



m214 . pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTL 
I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I 
a214 MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTL 
10 20 30 40 50 60 



70 80 90 100 110 120 

m214 . pep NISAARVNVTRGRQRRRIREGGRFASPLQPDIGRRQRHGARTGKQRCLFICRQHRSLNRX 
I I I I I I I I I I I I I I I I I I I t I I I I I I M I I I I 1 I i ) i I I I I I I I I I I I I I I I I I I I I I I 
a 2 1 4 N I S AARVNVTRGXQRRR IREGGRFAS PLQP D IGRRQRHGARTGKQRCLFI CRQHRSLNRX 

70 80 90 100 110 120 

130 140 150 

m214 . pep CQSTARRRCRRRCGDYIQHQNRSLYHQRQHKI 
I I I t I I I I I I I I I I t I I I I M I I I I I I I I I I I 
a214 CQSTARRRCRRRCGDYIQHQNRSLYHQRQHKIRRKIRFQIRQGQRRYPAFEYAKIRIIPM 

130 140 150 160 170 180 

a214 PSETXTWFGRHLPTEILKRYLX 
190 200 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 745>: 

g214-1.8eq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATTGTAA TTTTTGCAAC 

51 TTCTCCCGCT TTTGCCCTTC AAAGCGACAG CAGACGGCCC ATCCAAATCG 

101 AAGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGTAC CACATTTAGC 

151 GGCAATGTCA TCATCAGACA GGGTACGCTC AACATTTCCG CCTCGCGCGT 

201 CAACGTCACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

25X CGCCCGTCCG CTTCAGCCAA ACGTTGGACG GGGGCAAAGG GACGGTGCGC 

301 GGTCAGGCAA ACAACGTTAC CTATTCCTCC GCAGGAAGCA CCGTCGTTCT 

351 GACCGGCAAT GCCAAAGTGC AGCGCGGCGG CGACGTTGCC GAAGGTGCGG 

401 TCATTACCTA CAACACCAAA ACCGAAGTCT ATACCATCAA CGGCAGCACG 

451 AAATCGGGTG CGAAATCCGC TTCCAAAACC GGCAGGGTCA GCGTCGTCAT 

501 CCAGCCTTCA AGCACACAAA AAACCGAATA A 

This corresponds to the amino acid sequence <SEQ ID 746; ORF 214-1. ng>: 

g214-l.pap 

1 MIQKICKLFV LIVIFATSPA FA LQSDSRRP IQIEADQGSL DQANQSTTFS 
51 GNVIIRQGTL NISASRVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 
101 GQANNVTYSS AGSTWLTGN AKVQRGGDVA EGAVITYNTK TEVYTINGST 
151 KSGAKSASKT GRVSWIQPS STQKTE* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 747>: 

m214-l. seq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 
51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 
101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 
151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 
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201 CAATGTTACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCAGTCCG CTTCAGCCAG ACATTGGACG GCGGCAAAGG CACGGTGCGC 

301 GGACAGGCAA ACAACGTTGC TTATTCATCT GCAGGCAGCA CCGTAGTCTT 

351 AACCGGTAAT GCCAAAGTAC AGCGCGGCGG CGATGTCGCC GAAGGTGCGG 

401 TGATTACATA CAACACCAAA ACCGAAGTCT ATACCATCAG CGGCAGCACA 

451 AAATCCGGCG CAAAATCCGC TTCCAAATCC GGCAGGGTCA GCGTCGTTAT 

501 CCAGCCTTCG AGTACGCAAA AATCCGAATA A 

This corresponds to the amino acid sequence <SEQ ID 748; ORF 214-1>: 

in214-l .pep 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 

51 GNWIRQGTL NISAARVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 

101 GQANNVAYSS AGSTVVLTGN AKVQRGGDVA EGAVITYNTK TEVYTISGST 

151 KSGAKSASKS GRVSWIQPS STQKSE* 



m214-l/g214-l 93.8% identity in 176 aa pverlap 

10 20 30 40 50 60 

in2 14-1 .pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTL 

Ilillll::)::illltlillll:||lillll|| ) I I I I I I I I : I I I I i I 

g21 4 - 1 MIQKICKLFVLI VIFATSPAFALQSDSRRPIQIEADQGSLDQANQSTTFSGNVIIRQGTL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ra21 4 - 1 . pep NISAARVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVAYSS AGSTVVLTGN 
II MHIIIIMIMIIIMIIMI III llllllilll mill 11:111111 III Mil 
g2 1 4 - 1 NX S AS RVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVTYSS AGSTVVLTGN 

70 80 90 100 110 120 

130 140 150 160 170 

m2 1 4 - 1 . pep AKVQRGGDVAEGAVITYNTKTEVYTISGSTKSGAKSASKSGRVS WIQPSSTQKSEX 

I M I N I t I I I I I I I I I I I I I : I II II I I II M I : I I I I I I : I I 

g214-l AKVQRGGDVAEGAVITYNTKTEVYTINGSTKSGAKSASKTGRVSWIQPSSTQKTEX 

130 140 150 160 170 

g214-l/p3B685 

sp|P38685|YHBN_ECOLI 17.3 KD PROTEIN IN MURA-RPON INTERGENIC REGION PRECURSOR (ORF185) 
>gil551336 (012684) orfl85 [Escherichia coli} >gi|606139 (U18997) ORF_ol85 (Escherichia coli] 
>gi 11789592 {AE000399) orf, hypothetical protein [Escherichia coli] Length - 185 
Score 97.1 bits (238), Expect = 6e-20 

Identities = 57/126 (45%), Positives = 74/126 (58%), Gaps « 3/126 (2%) 

Query: 19 PAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTLNISAARVNVTR—GGKGG 7 6 

PAFA+ D+ QPI lE+DQ SLD TF+GNV++ QGT+ I+A +V VTR G +G 

Sbjct: 2 4 PAFAVTGDTDQPIHIESDQQSLDMQGNVVTFTGNVIVTQGTIKINADKWVTRPGGEQGK 83 

Query: 77 ESVRAEGSPVRFSQTLDGGKGTVRGQANNVAYSSAGSTVVLTGNAKVQRGGDVAEGAVIT 136 

E+ GP FQ DGK VGA+ + Y A WLTGNA +Q+ +G IT 

Sbjct: 84 EVIDGYGKPATFYQMQDNGK-PVEGHASQMHYELAKDFWLTGNAYLQQVDSNIKGDKIT 142 

Query: 137 YNTKTE 142 

Y K + 
Sbjct: 143 YLVKEQ 148 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 749>: 

a214-1.8aq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 

51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 

101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 

151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 

201 CAATGTTACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCAGTCCG CTTCAGCCAG ACATTGGACG GCGGCAAAGG CACGGTGCGC 

301 GGACAGGCAA ACAACGTTGC TTATTCATCT GCAGGCAGCA CCGTAGTCTT 

351 AACCGGTAAT GCCAAAGTAC AGCGCGGCGG CGATGTCGCC GAAGGTGCGG 

401 TGATTACATA CAACACCAAA ACCGAAGTCT ATACCATCAG CGGCAGCACA 

451 AAATCCGGCG CAAAATCCGC TTCCAAATCC GGCAGGGTCA GCGTCGTTAT 

501 CCAGCCTTCG AGTACGCAAA AATCCGAATA A 

This corresponds to the amino acid sequence <SEQ ID 750; ORF 214-1. a>: 

a214-l.pap 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 
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51 GNVVIRQGTL NISAARVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 
101 GQANNVAYSS AGSTWLTGN AKVQRGGDVA EGAVITYNTK TEVYTISGST 
151 KSGAKSASKS GRVSWIQPS STQKSE* 

a214-l/iii214-l 100.0% identity in 176 aa overlap 

10 20 30 40 50 60 

a2 14-1, pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTL 
llltllllllllllllMllllllllillMIIIIIIIMIMIittllllllllMIM 
m214-l MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTL 

10 20 30 40 50 60 

70 80 90 100 110 120 

a2 14-1. pep NISAARVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVAYSS AGSTWLTGN 
llllllllllllltllllMlllltllllllllllllllllllllllllltlllllllll 
m2 1 4 - 1 N I SAARVNVTRGGKGGES VRAEGS PVRFSQTLDGGKGTVRGQANNVAYSS AGSTWLTGN 

70 80 90 100 110 120 

130 140 150 160 170 

a214-l.pep AKVQRGGDVAEGAVITYNTKTEVYTISGSTKSGAKSASKSGRVSWIQPSSTQKSEX 

illltltll ItlltltllllltlllllDIM MIIIIIIIIIIIII 

II1214-1 AKVQRGGDVAEGAVITYNTKTEVYTISGSTKSGAKSASKSGRVSWIQPSSTQKSEX 

130 140 150 160 170 



g215 . seq 








1 


atgaaagtaa 


gatggcggta 


cggaattgcg 


51 


tgccttgggc 


agcctgtcgg 


catggttggg 


101 


tcgaggaagt 


caggctcaat 


cccgacgaac 


151 


ttggacggaa 


ggcggtttga 


cgaacaggga 


201 


cgcgaaaggt 


gcgaaacagt 


ttcccgaaaa 


251 


cgccgcatct 


cgtgttcttc 


caagaaggca 


301 


agcgatgaag 


ccgtttacca 


taccgaaaac 


351 


caacgttgtg 


ctgaccaaaa 


ccgccgacgg 


401 


tcgaaaccga 


aaaactgcac 


gtcgataccg 


451 


gatacgcctg 


tcagtttcca 


atatggcgcg 


501 


tatgacctac 


aaccacaaaa 


caggcatgtt 


551 


aagccgcgat 


ttatgataca 


aaagatatgt 



This corresponds to the amino acid sequence <SEQ E> 752; ORF 215.ng>: 

g215 .pep 

1 MKVRWRYGIA FPLILAVALG SLSA WLGRIS EVEISEVRLN PDEPQYTMDG 

51 LDGRRFDEQG YLKEHLSAKG AKQFPENSDI HFDSPHLVFF QEGRLLYEVG 

101 SDEAVYHTEN KQVLFKNNW LTKTADGRRQ AGKVETEKLH VDTESQYAQT 

151 DTPVSFQYGA SHGQAGGMTY NHKTGMLNFS SKVKAAIYDT KDM* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 753>: 

m215.seq (partial) 

1 . .AGCCTGTCGG CATGGTTGGG TCGTATCAGC GAAGTCGAGA TTGAAGAAGT 

51 CAGGCTCAAT CCCGACGAAC CGCAATACAC AATGGACAGC TTGGACGGCA 

101 GGCGGTTTGA CGAACAGGGA TACTTGAAAG AACATTTGAG CGCGAAGGGC 

151 GCGAAACAGT TTCCGGAAAG CAGCGACATC CATTTTGATT CGCCGCATCT 

201 CGTGTTCTTC CAAGAAGGCA GGTTGTTGTA CGAAGTCGGC AGCGACGAAG 

251 CCGTTTACCA TACCGAAAAC AAACAGGTTC TTTTTAAAAA CAACGTTGTG 

301 CTGACCAAAA CCGCCGACGG CAAACGGCAG GCGGGTAAAG TTGAAGCCGA 

351 AAAGCTGCAC GTCGATACCG AATCTCAATA TGCCCAAACC GATACGCCTG 

401 CAGTTTCCA ATATGGTGCA TCGCACGGTC AGGCGGGCGG CATGACTTAC 

451 GACCACAwwA CAGGCATGTT GAACTTCTCA TCTAAAGTGA AAGCCACGAT 

501 TTATGATACA AAAGATATGT AA 

This corresponds to the amino acid sequence <SEQ ID 754; ORF 21 5>: 

m215.pep (partial) 

1 . . SLSA WLGRIS EVEIEEVRLN PDEPQYTMDS LDGRRFDEQG YLKEHLSAKG 

51 AKQFPESSDI HFDSPHLVFF QEGRLLYEVG SDEAVYHTEN KQVLFKNNW 

101 LTKTADGKRQ AGKVEAEKLH VDTESQYAQT DTPVSFQYGA SHGQAGGMTY 
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151 DHXTGMLNFS SKVKATIYDT KDM* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 215 shows 96.0% identity over a 173 aa overlap with a predicted ORF (ORF 215.ng) 
from K gonorrhoeae: 

m215/g215 

10 20 30 40 

SLSAWLGRISEVEIEEVRLNPDEPQYTMDSLDGRRFDEQG 

IIIIIIIIIIMIIIIIIIIIIIIIIIIhllllllllll 
MKVRWRYGIAFPLILAVALGSLSAWUSRISEVEIEEVRLNPDEPQYTMDGLDGRRFDEQG 
10 20 30 40 50 60 

SO 60 70 80 90 100 

YLKEHLSAKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 

IIIMIIIIIIIIIIhlllllllllllllllllllllllllllMIIIIIIIIIIIIM 

YLKEHLSAKGAKQFPENSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 
70 80 90 100 110 120 



m215 .pep 
g215 

m215.pep 
g215 



110 120 130 140 150 160 

m2 15 . pep LTKTADGKRQAGKVEAEKIiHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHXTGMLNFS 

llllllhllllllhlllllllllllllllllllllllllllllMllhl lllllll 
g2 1 5 LTKTADGRRQAGKVETEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYNHKTGMLNFS 
130 140 150 160 170 180 

170 

m2 1 5 . pep SKVKATI YDTKDMX 

Illlhlllllll 
g 2 1 5 S KVKAAI YDTKDM 

190 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 755>: 

a215, seq 

1 ATGAAAGTAA GATGGCGGTA 

51 TGCCTTGGGC AGCCTGTCGG 

101 TTGAAGAAGT CAGGCTCAAT 

151 TTGGATGGCA GGCGGTTTGA 

201 TTCGAAGGGC GCGAAACAGT 

251 CACCGCATCT CGTGTTCTTC 

301 AGCGATGAAG CCGTTTACCA 

351 CAACGTTGTG CTGACCAAAA 

401 TTGAAGCCGA AAAGCTGCAC 

451 GATACGCCTG TCAGTTTCCA 

501 CATGACTTAC GACCACAAAA 

551 AAGCCACGAT TTATGATACA 



CGGAATTGCG TTCCCATTGA TATTGGCGGT 
CATGGTTGGG ACGCATCAGC GAAGTCGAGA 
CCCGACGAAC CGCAATACAC AATGGACGGA 
CGAACAGGGA TACTTGAAAG AACATTTGAG 
TTCCCGAAAG CAGCGACATT CATTTCGACT 
CAAGAAGGCA GGTTGTTGTA CGAAGTCGGC 
TACCGAAAAC AAACAGGTTC TTTTTAAAAA 
CCGCCGACGG CAAACGGCAG GCGGGTAAAG 
GTCGATACCG AATCTCAATA TGCCCAAACC 
ATATGGTGCA TCGCACGGTC AGGCGGGCGG 
CAGGCATGTT GAACTTCTCA TCTAAAGTGA 
7WVGATATGT AA 



This corresponds to the amino acid sequence <SEQ ID 756; ORF 215.a>: 

a215.pep 

1 MKVRWRYGIA FPLILAVALG SLS AWLGRIS EVEIEEVRLN PDEPQYTMDG 

51 LDGRRFDEQG YLKEHLSSKG AKQFPESSDI HFDSPHLVFF QEGRLLYEVG 

101 SDEAVYHTEN KQVLFKNNW LTKTADGKRQ AGKVEAEKLH VDTESQYAQT 

151 DTPVSFQYGA SHGQAGGMTY DHKTGMLNFS SKVKATIYDT KDM* 

m215/a215 983% identity in 173 aa overlap 

10 20 30 40 

10215 . pep SLSAWLGRISEVEIEEVRLNPDEPQYTMDSLDGRRFDEQG 

I i I I I I I I I I I I I I I I I I I I I I I I I I i ! I : I I I I I I I I I I 
a215 MKVRWRYGIAFPLILAVALGSLSAWLGRISEVEIEEVRLNPDEPQYTMDGLDGRRFDEQG 
10 20 30 40 50 60 



50 60 70 80 90 100 

m215.pep YLKEHLSAKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 
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I I I I t I I : I I I I I I I I I t I 1 I i I I I I I M I I I I I I ) I I I I I [ I I I I I I I I I I I I I I I I I I 
a215 YLKEHLSSKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 
70 80 90 100 110 120 

110 120 130 140 150 160 

m2 15 . pep LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHXTGMLNFS 
I I I I t I I I I I I I I I I I t I I I I I I M I I I I I I t I I It I I I I I I I I I I I I I I I I I I I I I I I 
a215 LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHKTGMLNFS 

130 140 150 160 170 180 

170 

m215 . pep SKVKATIYDTKDMX 
I I I I t I I I I I I I I I 
a215 SKVKATIYDTKDMX 
190 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 757>: 

g216.fieq (partial) 

1 . .atgatatcga tttcgagctc ggtacccagc gacgaaatca ccgccatcat 
51 ccccgcactc aaacgcaaag acattaccct cgtctgcatc accgcccgcc 
101 ccgattcaac catggcgcgc catgccgata tccacatcac cgcatcggtt 
151 tcgcaagaag cctgcccgtt ggggcttgcc ccgaccacca gcaccaccgc 
201 cgttatggct ttgggcgacg cgttggcggt cgtcctgctg cgcgcccgcg 
251 cgttcacgcc cgacgacttc gccttgatcc accctgccgg cagcctcggc 
301 aaacgcctgc ttttgcgcgt tgccgacatt atgcacaaag gcggcggcct 
351 gcccgccgtc cgactcggca cgcccttgaa aggagccatc gtcagcatga 
401 gcgagaaagg tttgggcatg tgggcgggaa cggacgggca aaggctgtct 
451 gaaaggcctt tttactga 

This corresponds to the amino acid sequence <SEQ ID 758; ORF 216.ng>: 

g216 .pep (partial) 

1 . .MISISSSVPS DEITAIIPAL KRKDITLVCI TARPDSTMAR HADIHITASV 
51 SQEACPLGLA PT TSTTAVMA LGDALAWLL RARAFTPDDF ALIHPAGSLG 
101 KRLLjLRVADI MHKGGGLPAV RLGTPLKGAI VSMSEKGLGM WAGTDGQRLS 
151 ERPFY* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 759>: 

m216 . seq 

1 ATGGCAATGG CAGAAAACGG AAAATATCTC GACTGGGCAC GCGAAGTGTT 

51 GCACGCCGAA GCGGAAGGCT TGCGCGAAAT TGCAGCGGAA ttgsacaaaa 

101 acttcgtcct tgcggcagac gcgttgttgc actgcaaggg cagggtcgtt 

151 atcacgggca tggtcaagtc gggacatatc gggcgcaaaa tggcggcaac 

201 tatggcctcg accggcacgc ctgcgttttt cgtccaccct gcggaagcgg 

251 CACACGgCGA TTTGGGTATG ATTGTGGACA rCGACGTGGT CGTCGCGATT 

301 tccaattccg gcgaaagcga cgaaatcgcc gccatcatcc CCGCACTCAA 

351 ACGCAAAGAC atcaogcttg TCTGCATCAC CGCCCGCCCC gattcaacca 

401 tggcgcgcca tgccgacatc cacatcacgg cgtcggtttc caaagaagcc 

451 tgcccgctgg ggcttgcccc gaccaccagc accaccgccg tcatggcttt 

501 GGGCGATGCG TTGGCGGTCG TCCtGCTGCG CgcACGCGCG TTCACGCCCG 

551 ACGATTTCGC CTTGAGCCAT CCTGCCGGCA GCCTCGGCAA ACGCCTACTT 

601 TTGCGCGTTG CCGACATTAT GCACAAAGGC GGCGGCCTGC CTGCCGTCCG 

651 ACTCGGCACG CCCTTGAAAG AAGCCATCGT CAGCATGAGT GAAAAAGGGC 

701 TGGGCATGTT GGCGGTAACG GACGGGCAAG GCCGTCTGAA AGGCGTATTC 

751 ACCGACGGCG ATTTGCGCCG CCTGTTTCAA GAATGCGACA ATTTTACCGG 

801 TCTTTCGATA GACGAAGTCA TGCATACGCA TCCTAAAACC ATCTCCGCCG 

851 AACGTCTCGC CACCGAAGCC CTGAAAGTCA TGCAGGCAAA CCATGTGAAC 

901 GGGCTTCTGG TTACCGATGC AGATGGCGTG CTGATCGGCG CGCTGAATAT 

951 GCACGACCTG CTGGCGGCAC GGATTGTATA G 

This corresponds to the amino acid sequence <SEQ ID 760; ORF 216>: 
m2l6 .pep 

1 MAMAENGKYL DWAREVLHAE AEGLREIAAE LXKNFVLAAD ALLHCKGRW 
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51 ITGMVKSGHI GRKMAATMAS TGTPAFFVHP AEAAHGDLGM IVDXDVWAI 
101 SNSGESDEIA AIIPALKRKD ITLVCITARP DSTMARHADI HITASVSKEA 
151 CPLGLAPTT S TTAVMALGDA liAWLL RARA FTPDDFALSH PAGSLGKRLL 
201 LRVADIMHKG GGLPAVRLGT PLKEAIVSMS EKGLGMLAVT DGQGRLKGVF 
251 TDGDLRRLFQ ECDNFTGLSI DEVMHTHPKT ISAERLATEA LKVMQANHVN 
301 GLLVTDADGV LIGALNMHDL LAARIV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 216 shows 91.8% identity over a 147 aa overlap with a predicted ORF (ORF 216.ng) 
from K gonorrhoeae: 

m216/g216 

70 80 90 100 110 120 

tmastgtpaffvhpaeaahgdlgmivdxdvwaisnsgesdeiaaiipalkrkditlvci 

•'-■•'\\--\ lllhlllillllllllllll 

MIS ISS S VPSDE I TAXI PALKRKDI TLVC I 
10 20 30 

130 140 150 160 170 180 

tarpdstmarhadihitasvskeacplglapttsttavmalgdalawllraraftpddf 

llllllllllllllllllllhlllllllllllllllllllllllllllllMIIIIIII 
tarpdstmarhadihitasvsqeacplglapttsttavmalgdalawllraraftpddf 

40 50 60 70 80 90 

190 200 210 220 230 240 

alshpagslgkrlllrvadimhkggglpavrlgtplkeaivsmsekglgmlavtdgqgrl 
II llllllllllllllllllllllllllllllllll llllllllllll I INI 
alihpagslgkrlllrvadimhkggglpavrlgtplkgaivsmsekglgmwagtdgqrls 

100 110 120 130 140 150 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 761 >: 

a216.seq 

1 ATGGCGATGG CAGGAAACGA AAAATATCTT GATTGGGCAC GCGAAGTGTT 

51 GCACACCGAA GCGGAAGGCT TGCGCGAAAT TGCGGCGGAT TTGGACGAAA 

101 ACTTCGCCCT TGCGGCGGAC GCGTTGTTGC ACTGCAAAGG CAGGGTCGTT 

151 ATCACGGGCA TGGGCAAGTC GGGACATATC GGGCGCAAAA TGGCGGCAAC 

201 CATGGCCTCG ACCGGCACGC CCGCGTTTTT CGTCCACCCT GCGGAAGCGG 

251 CACACGGCGA TTTGGGCATG ATTGTGGACA ACGACGTGGT CGTCGCGATT 

301 TCCAATTCCG GTGAAAGCGA CGAAATCGCC GCCATCATCC CCGCGCTCAA 

351 ACGCAAAGAT ATCACGCTTG TCTGCATCAC CGCCCGCCCC GATTCAACCA 

401 TGGCGCGCCA TGCCGACATC CACATCACGG CGTCGGTTTC CAAAGAAGCC 

451 TGCCCGCTGG GGCTTGCCCC GACCACCAGC ACCACCGCCG TTATGGCTTT 

501 GGGCGATGCG TTGGCGGTTG TCCTGCTGCG CGCCCGCGCG TTCACGCCCG 

551 ACGACTTCGC CTTGAGCCAC CCTGCCGGCA GCCTCGGCAA ACGCCTACTT 

601 TTGCGCGTTG CCGACATTAT GCACAAAGGC GGCGGCCTGC CTGCCGTCCG 

651 ACTCGGCACG CCCTTGAAAG AAGCCATCGT CAGCATGAGT GAAAAAGGGC 

701 TGGGCATGTT GGCGGTAACG GACGGGCAAG GCCGTCTGAA AGGCGTATTC 

751 ACCGACGGCG ATTTGCGCCG CCTGTTTCAA GAATGCGACA ATTTTACCGG 

801 TCTTTCGATA GACGAAGTCA TGCATACGCA TCCTAAAACC ATCTCCGCCG 

851 AACGTCTCGC CACCGAAGCC CTGAAAGTCA TGCAGGCAAA CCATGTGAAC 

901 GGGCTTCTGG TTACCGATGC AGATGGCGTG CTGATCGGCG CGCTG/^TAT 

951 GCACGACCTT TTGGCGGCGC GGATTGTATA G 

This corresponds to the amino acid sequence <SEQ ID 762; ORF 216.a>: 

a216.pep 

1 MAMAGNEKYL DWAREVLHTE AEGLREIAAD LDENFALAAD ALLHCKGRW 

51 ITGMGKSGHI GRKMAATMAS TGTPAFFVHP AEAAHGDLGM IVDNDVWAI 

101 SNSGESDEIA AIIPALKRKD ITLVCITARP DSTMARHADI HITASVSKEA 

151 CPLGLAPTT S TTAVMALGDA LAWLL RARA FTPDDFALSH PAGSLGKRLL 

201 LRVADIMHKG GGLPAVRLGT PLKEAIVSMS EKGLGMLAVT DGQGRLKGVF 

251 TDGDLRRLFQ ECDNFTGLSI DEVMHTHPKT ISAERLATEA LKVMQANHVN 



m216 .pep 
g216 

m2 1 6 . pep 
g216 

m216 .pep 
g216 
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301 GLLVTDADGV LIGALNMHDL LAARIV* 

ml 1 6/a2 1 6 97.2% identity in 326 aa overlap 

10 20 30 40 50 60 

m216.pep MAMAENGKYLDWAREVLHAEAEGLREIAAELXKNFVLAADALLHCKGRVVITGMVKSGHI 
Mill lllllllllll:lllllltlll:| :M:Mlllllllllllllltl Mill 
a216 MAMAGNEKYLDWAREVLHTEAEGLREIAADLDENFALAADALLHCKGRWITGMGKSGHI 

10 20 30 40 50 60 

70 80 90 100 110 120 

m21 6 . pep GRKblAATMASTGTPAFFVHPAEAAHGDLGMIVDXDVWAISNSGESDEIAAIIPALKRKD 
I I I I I II I II I II II II I i II II I ) II I I II II i I I I I I I I I I I I II I I I I I I I I I I I i 
a 2 1 6 GRKMAATMASTGT PAFFVH PAE AAHGDLGMI VDNDVWAI SNSGE S DE I AAI I PALKRKD 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 1 6 . pep ITLVCITARPDSTMARHADIHITASVSKEACPLGLAPTTSTTAVMALGDALAWLLRARA 
I I I I I I I I I I I I I I I II I i I I I I I II I I I I I 1 I I I I I I I II I I I I I I I I I I I I I I I I I I I 
a216 ITLVCITARPDSTMARHADIHITASVSKEACPLGLAPTTSTTAVMALGDALAWLLRARA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m21 6 . pep FTPDDFALSHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKEAIVSMSEKGLGMLAVT 

I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I II I I I I I I I I I I I I I I 
a216 FTPDDFALSHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKEAIVSMSEKGLGMLAVT 

190 200 210 220 230 240 

250 260 270 280 290 300 

m216 . pep DGQGRLKGVFTDGDLRRLFQECDNFTGLSIDEVMHTHPKTISAERLATEALKVMQANHVN 

I I i 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II j 1 1 i 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 

a216 DGQGRLKGVFTDGDLRRLFQECDNFTGLSIDEVMHTHPKTISAERLATEALKVMQANHVN 
250 260 270 280 290 300 

310 320 
m2 1 6 , pep GLLVTDADGVLIGALNMHDLLAARIVX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a21 6 GLLVTDADGVLIGALNMHDLLAARIVX 

310 320 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 763>: 

g217. seq 

1 atggcggatg acggtttgtt gcggcaactg tccgaaaaac ccagccaaag 

51 tgctctcttc ctgccatttg acccattcgt tttcgaggtt ttggactgcc 

101 ttttggtcat cgggcccggc ttgaaacaat gtttcaagca aatcccggca 

151 acgcgccacc cattcgccga ccgtcgcagg ttgccgccat atccgggcaa 

201 tatccgacag ggtttcgagg aaggcggcaa aacgtccgaa catggcggtt 

251 tgattcacgt cggcatacca cgcgctgaca tcctgccaca tcgggttgcc 

301 gccttcgggc agcatccagc ccaatatcat acggtctgcc gcctgcttcc 

351 aggtaaacag ctgatccgtg ccgccgcgca tttctccgtc caatccccaa 

401 tggacgttca aatcggcaac catatcgtgc aaaagcggca aatcgtcccc 

451 ggtcagtccg aaacggcgca acacgggcgc ggtttccaaa agcgcgagca 

501 ctttgccgac ttcaaaacgg ctttccagca agtcggacac gcactccaac 

551 gcataaaaaa acggttgccg gcggctgatt ttcacgtccg aaacggaata 

601 cggcaatgcc tgcgcgccgg gttgcgcctg tccgaacacg gcttccataa 

651 aaggcgtata gggttcgata ttcggggtta a 

This corresponds to the amino acid sequence <SEQ ID 764; ORF 217.ng>: 

g217.pep. . 

1 MADDGLLRQL SEKPSQSALF LPFDPFVFEV LDCLLVI GPG LKQCFKQIPA 

51 TRHPFADRRR LPPYPGNIRQ GFEEGGKTSE HGGLIHVGIP RADILPHRVA 

101 AFGQHPAQYH TVCRLLPGKQ LIRAAAHFSV QSPMDVQIGN HIVQKRQIVP 

151 GQSETAQHGR GFQKREHFAD FKTAFQQVGH ALQRIKKRLP AADFHVRNGI 
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201 RQCLRAGIiRL SEHGFHKRRI GFDIRG* 

The following partial DNA sequence was identified in A'', meningitidis <SEQ ID 765>: 

m217 . seq 

1 ATGGCGGATG ACGGTGTGCG GCGGCAACTG TCCGGAAAAT TGCGCCAATT 

51 CGGTTTCCGC CTrCCATTTG ACCCATTCGT TTTCAAGGTT TTGGACTGAC 

101 TTTTGGTCAT CGGCTTCAGC TTGGAACAAT GTTTCAAGCA AATCCCGGCA 

151 ACGCGCCACC CATTCGCCGA CCGTTGCGGG CTGCCGCCAT ATCCGTACAA 

201 TATCCGTCAG GGTTTCGAGG AAGGCGGCAA AACGTCCGAA CATGGCGGTT 

251 TGATTCACGT CGGCATACCA CGCGCTGACA TCCTGCCACA TCGGATTGCC 

301 GCCTTTGGGC AGCATCCAGC CCAATATCAT GCGTTCTACC GCCTGCTTCC 

351 AGGTGAACAG CTGATCCGTG CCGCCGCGCA TTTCTCCGTC CAAACCCCAG 

401 TGGACGTTCA AATCGGCAAC CATGTCGTGC AAAAGCGGTA AATCGTCCTC 

451 AGTCAGTCCG AAACGGCGCA ACACGGGCGC GGTTTCTAAA AGCACAAGCA 

501 CTTTATCGAC TTCAAATCGG CTTTCCAACA AGTCGAACAG GCATGACAAA 

551 GCATGAAACA GCGGTTGGCG GCGGCTGATT TTCACGTCTG ACACGGAATA 

601 CGGCAATGCC TGCGCACCgG GctGCGCCTG TCCGAACACG GCTTCGATAA 

651 AAGGCGTATA GGATTCGATA TTCGGGGTTA A 

This corresponds to the amino acid sequence <SEQ ID 766; ORF 21 7>: 

m217 .pep 

1 MADDGVRRQL SGKLRQFGFR LPFDPFVFKV LDXLLVI GFS LEQCFKQIPA 

51 TRHPFADRCG LPPYPYNIRQ GFEEGGKTSE HGGLIHVGIP RADILPHRIA 

101 AFGQHPAQYH AFYRLLPGEQ LIRAAAHFSV QTPVDVQIGN HWQKRXIVL 

151 SQSETAQHGR GFXKHKHFID FKSAFQQVEQ AXQSMKQRIiA AADFHVXHGI 

201 RQCLRTGLRL SEHGFDKRRI GFDIRG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 217 shows 80.5% identity over a 226 aa overlap with a predicted ORF (ORF 217.ng) 
from K gonorrhoeae: 

m217/g217 



10 20 30 40 50 60 

m2 17 . pep MADDGVRRQLSGKLRQFGFRLPFDPFVFKVLDXLLVIGFSLEQCFKQIPATRHPFADRCG 

IMIh MM I I IIMIMhIM MMI :hMMMMMMMM 

g217 MADDGLLRQLSEKPSQSALFLPFDPFVFEVLDCLLVIGPGLKQCFKQIPATRHPFADRRR 
10 20 30 40 50 60 



70 80 90 100 110 120 

02 17 . pep LPPYPYNIRQGFEEGGKTSEHGGLIHVGI PRADILPHRIAAFGQHPAQYHAFYRLLPGEQ 

MMI MMMMMMMMMMMMIMMMhMMMMMh MMIM 

g217 LPPYPGNIRQGFEEGGKTSEHGGLIHVGIPRADILPHRVAAFGQHPAQYHTVCRLLPGKQ 
70 80 90 100 110 120 



130 140 150 160 170 180 

m2 17 . pep LIRAAAHFSVQTPVDVQIGNHWQKRXIVLSQSETAQHGRGFXKHKHFIDFKSAFQQVEQ 
llllllllllhlHIIIIIhllll II :||||||||||| |::|Mlhlllll = 
g2 1 7 LIRAAAHFSVQS PMDVQIGNHIVQKRQIVPGQSETAQHGRGFQKREHFADFKTAFQQVGH 

130 140 150 160 170 180 

190 200 210 220 

m2 17 . pep AXQSMKQRLAAADFHVXHGIRQCLRTGLRLSEHGFDKRRIGFDIRGX 

I I MMI MMM MMMMMMMMM MMMIMI 

g2 17 ALQRI KKRIiPAADFHVRNGIRQCLRAGLRLSEHGFHKRRIGFDIRG 

190 200 210 220 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 767>: 

a217 . seq 

1 GTGGCGGATG ACGGTGTGCA GCGGCAACTG TCCGGAAAAT TGCGCCAATT 

51 CGGTTTCCGC CTGCCATTTG ACCCATTCGT TTTCGAGGCT TTGGACTGCC 

101 TTTTGGTCAT CGCCTTCGAC TTGGAACAAT GTTTCAAGCA AATCCCGGCA 

151 ACGCGCCACC CATTCGTCAA CCGTCGCAGG TTGCCGCCAT ATCCGTACAA 



